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OM protein - protein search, using sw model 
Run on : 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



July 27, 2005, 07:32:56 ; Search time 215.886 Seconds 

(without alignments) 
1479.779 Million cell updates/sec 

US-09-922-958-4 
4287 

1 MEGAGGANDKKKISSERRKE QGSRNLLQGEELLRALDQVN 826 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



2105692 



Searched: 2105692 seqs, 386760381 residues 

Total number of hits satisfying chosen parameters:" 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : A_Geneseq_16Dec04 : * 

1: geneseqpl980s : * 

2: geneseqpl990s : * 

3: geneseqp2000s : * 

4: geneseqp2001s : * 

5: geneseqp2002s : * 

6: geneseqp2003as : * 

7: geneseqp2003bs : * 

8: geneseqp2004s : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
AAW06557 

ID AAW06557 standard; protein; 826 AA. 
XX 

AC AAW06557; 
XX 

DT 19-MAR-1997 (first entry) 
XX 

DE Human hypoxia inducible factor- 1 alpha. 

XX 

KW Hypoxia inducible factor- 1 alpha; HIF-1; tissue damage; atherosclerosis; 

KW cerebral artery disease; therapy. 

XX 

OS Homo sapiens . 
XX 

FH Key Location/ Qualifiers 



FT Domain 17. .30 

FT /label= DNA-binding_domain 

FT /note= "the DNA binding domain is deleted in engineered 

FT HIF-1 alpha variants" 

XX 

PN W09639426-A1. 

XX 

PD 12-DEC-1996. 
XX 

PF 06-JUN-1996; 96WO-US010251 . 
XX 

PR 06-JUN-1995; 95US-004 80473 . 
XX 

PA (UYJO ) UNIV JOHNS HOPKINS SCHOOL MED. 
XX 

PI Semenza GL; 
XX 

DR WPI; 1997-043061/04. 

DR N-PSDB; AAT45937. 
XX 

PT DNA encoding human hypoxia-inducible factor 1 alpha - useful for 

PT enhancing expression of structural gene and treatment of hypoxia-related 

PT tissue damage. 

XX 

PS Disclosure; Page 49-53; 95pp; English. 
XX 

CC The 120 kDa alpha subunit (AAW06557) of human hypoxia inducible factor-1 

CC (HIF-1) is a basic-helix-loop-helix polypeptide contg. a PAS domain whose 

CC expression is regulated by cellular 02 tension. It dimerises with a beta 

CC subunit to form HIF-1, a DNA-binding protein which binds to the enhancer 

CC region of e.g. erythropoietin and vascular endothelial growth factor 

CC genes. Its amino acid sequence was deduced from cDNA clones (see also 

CC AAT45937) obtd. from an Hep3B library. Recombinant HIF-1 alpha can be 

CC produced in transformed host cells and used to treat HIF-l-mediated or 

CC hypoxia- related disorders. Dominant-negative variant forms (see also 

CC AAW06558-59) of HIF-1 alpha have been engineered that form a 

CC nonfunctional HIF-1 complex 
XX 

SQ Sequence 826 AA; 

Query Match 100.0%; Score 4287; DB 2; Length 826; 
Best Local Similarity 100.0%; Pred. No. 0; 

Matches 826; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MEGAGGANDKKKISSERRKEKSRDAARSRRSKESEVFYELAHQLPLPHNVSSHLDKASVM 60 

I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 

Db 1 MEGAGGANDKKKISSERRKEKSRDAARSRRSKESEVFYELAHQLPLPHNVSSHLDKASVM 60 

Qy 61 RLTISYLRVRKLLDAGDLDIEDDMKAQMNCFYLKALDGFVMVLTDDGDMIYISDNVNKYM 120 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 RLTISYLRVRKLLDAGDLDIEDDMKAQMNCFYLKALDGFVMVLTDDGDMIYISDNVNKYM 120 

Qy 121 GLTQFELTGHSVFDFTHPCDHEEMREMLTHRNGLVKKGKEQNTQRSFFLRMKCTLTSRGR 180 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 

Db 121 GLTQFELTGHSVFDFTHPCDHEEMREMLTHRNGLVKKGKEQNTQRSFFLRMKCTLTSRGR 180 



Qy 



181 TMNIKSATWKVLHCTGHIHVYDTNSNQPQCGYKKPPMTCLVLICEPIPHPSNIEIPLDSK 240 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 11 

Db 181 T^WIKSATWKVLHCTGHIHVYDTNSNQPQCGYKKPP^4TCLVLICEPIPHPSNIEIPLDSK 240 

Qy 241 TFLSRHSLDMKFSYCDERITELMGYEPEELLGRSIYEYYHALDSDHLTKTHHDMFTKGQV 300 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
Db 241 TFLSRHSLDMKFSYCDERITELMGYEPEELLGRSIYEYYHALDSDHLTKTHHDMFTKGQV 300 

Qy 301 TTGQYRMLAKRGGYVWVETQATVIYNTKNSQPQCIVCVNYWSGIIQHDLIFSLQQTECV 360 

I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
Db 301 TTGQYRMLAKRGGYVWVETQATVI YNTKNSQPQCIVCVNYWSGIIQHDLI FSLQQTECV 360 

Qy 361 LKPVESSDMKMTQLFTKVESEDTSSLFDKLKKEPDALTLLAPAAGDTIISLDFGSNDTET 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I 
Db 361 LKPVESSDMKMTQLFTKVESEDTSSLFDKLKKEPDALTLLAPAAGDTIISLDFGSNDTET 420 

Qy 421 DDQQLEEVPLYNDVMLPSPNEKLQNINLAMSPLPTAETPKPLRSSADPALNQEVALKLEP 480 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 DDQQLEEVPLYNDVMLPSPNEKLQNINLAMSPLPTAETPKPLRSSADPALNQEVALKLEP 480 

Qy 481 NPESLELSFTMPQIQDQTPSPSDGSTRQSSPEPNSPSEYCFYVDSDMVNEFKLELVEKLF 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 NPESLELSFTMPQIQDQTPSPSDGSTRQSSPEPNSPSEYCFYVDSDMVNEFKLELVEKLF 540 

Qy 541 AEDTEAKNPFSTQDTDLDLEMLAPYIPMDDDFQLRSFDQLSPLESSSASPESASPQSTVT 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 17 I I I I I I I I I I I I I I I I I I I I 
Db 541 AEDTEAKNPFSTQDTDLDLEMLAPYIPMDDDFQLRSFDQLSPLESSSASPESASPQSTVT 600 

Qy 601 VFQQTQIQEPTANATTTTATTDELKTVTKDRMEDIKILIASPSPTHIHKETTSATSSPYR 660 

I J I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 601 VFQQTQIQEPTANATTTTATTDELKTVTKDRMEDIKI LIAS PS PTHIHKETT SATS SPYR 660 

Qy 661 DTQSRTASPNRAGKGVIEQTEKSHPRSPNVLSVALSQRTTVPEEELNPKILALQNAQRKR 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 661 DTQSRTASPNRAGKGVIEQTEKSHPRSPNVLSVALSQRTTVPEEELNPKILALQNAQRKR 720 

Qy 721 KMEHDGSLFQAVGIGTLLQQPDDHAATTSLSWKRVKGCKSSEQNGMEQKTIILIPSDLAC 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 721 KMEHDGSLFQAVGIGTLLQQPDDHAATTSLSWKRVKGCKSSEQNGMEQKTIILIPSDLAC 780 

Qy 781 RLLGQSMDESGLPQLTSYDCEVNAPIQGSRNLLQGEELLRALDQVN 826 

I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I M I I I I I 
Db 781 RLLGQSMDESGLPQLTSYDCEVNAPIQGSRNLLQGEELLRALDQVN 82 6 



RESULT 2 
AAW80418 

ID AAW80418 standard; protein; 826 AA. 
XX 

AC AAW80418; 
XX 

DT 14-JAN-1999 (first entry) 
XX 

DE Amino acid sequence of human hypoxia inducible factor 1 alpha. 
XX 

KW Human transforming growth factor-beta 3; TGF-beta3; oxygen tension; 

KW trophoblast invasion regulation; inhibitor; HIF-1 alpha; 

KW TGF-beta family cytokine receptor; hypoxia inducible factor 1 alpha; 



KW preeclempsia; pregnanancy; choriocarcinoma; ss. 
XX 

OS Homo sapiens. 
XX 

PN WO9840747-A1. 

XX 

PD 17-SEP-1998. 
XX 

PF 05-MAR-1998; 98WO-CA000180 . 
XX 

PR 07-MAR-1997; 97US-0039919P . 
XX 

PA (MOUN ) MOUNT SINAI HOSPITAL CORP. 

PA (HOSP-) HOSPITAL FOR SICK CHILDREN. 
XX 

PI Caniggia I, Post M, Lye S; 
XX 

DR WPI; 1998-520837/44. 

DR N-PSDB; AAV63210. 
XX 

PT Regulation of trophoblast invasion - by, e.g. transforming growth factor- 

PT beta3 inhibitor, useful for detecting or treating preeclempsia in 

PT pregnant women. 
XX 

PS Disclosure; Fig 2; 59pp; English. 
XX 

CC The present sequence encodes human hypoxia inducible factor 1 alpha (HIF- 

CC 1 alpha) . The specification describes a composition for regulating 

CC trophoblast invasion which comprises an inhibitor of transforming growth 

CC factor-beta 3 (TGF-beta3) , TGF-beta family cytokine receptors, HIF-1 

CC alpha or oxygen tension. The composition is used in methods of 

CC diagnosing, monitoring, preventing or treating conditions requiring 

CC regulation of trophoblast invasion, especially preeclempsia in pregnant 

CC women or choriocarcinomas 

XX 

SQ Sequence 826 AA; 

Query Match 100.0%; Score 4287; DB 2; Length 826; 
Best Local Similarity 100.0%; Pred. No. 0; 

Matches 826; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MEGAGGANDKKKISSERRKEKSRDAARSRRSKESEVFYELAHQLPLPHNVSSHLDKASVM 60 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I 

Db 1 MEGAGGANDKKKISSERRKEKSRDAARSRRSKESEVFYELAHQLPLPHNVSSHLDKASVM 60 

Qy 61 RLTISYLRVRKLLDAGDLDIEDDMKAQMNCFYLKALDGFVMVLTDDGDMIYISDNVNKYM 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 RLTISYLRVRKLLDAGDLDIEDDMKAQMNCFYLKALDGFVMVLTDDGDMIYISDNVNKYM 120 

Qy 121 GLTQFELTGHSVFDFTHPCDHEEMREMLTHRNGLVKKGKEQNTQRSFFLRMKCTLTSRGR 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 GLTQFELTGHSVFDFTHPCDHEEMREMLTHRNGLVKKGKEQNTQRSFFLRMKCTLTSRGR 180 



Qy 

Db 



181 TMNIKSATWKVLHCTGHIHVYDTNSNQPQCGYKKPPMTCLVLICEPIPHPSNIEIPLDSK 240 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
181 TMNIKSATWKVLHCTGHIHVYDTNSNQPQCGYKKPPMTCLVLICEPIPHPSNIEIPLDSK 240 



241 TFLSRHSLDMKFSYCDERITELMGYEPEELLGRSIYEYYHALDSDHLTKTHHDMFTKGQV 300 

I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I 
241 TFLSRHSLDMKFSYCDERITELMGYEPEELLGRSIYEYYHALDSDHLTKTHHDMFTKGQV 300 

301 TTGQYRMLAKRGGYVWVETQATVIYNTKNSQPQCIVCVNYWSGIIQHDLIFSLQQTECV 360 

I II II I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
301 TTGQYRMLAKRGGYVWVETQATVIYNTKNSQPQCIVCVNYWSGIIQHDLIFSLQQTECV 360 

361 LKPVESSDMKMTQLFTKVESEDTSSLFDKLKKEPDALTLIiAPAAGDTIISLDFGSNDTET 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I II I I I I 
361 LKPVESSDMKMTQLFTKVESEDTSSLFDKLKKEPDALTLLAPAAGDTIISLDFGSNDTET 420 

421 DDQQLEEVPLYNDVMLPSPNEKLQNINLAMSPLPTAETPKPLRSSADPALNQEVALKLEP 480 

I I I I I I I I I I II I I I I I I I I I I I II I I II II I I I I II I I I I I I I I I II I I I I I I I II I I I 
421 DDQQLEEVPLYNDVMLPSPNEKLQNINLAMSPLPTAETPKPLRSSADPALNQEVALKLEP 480 

481 NPESLELSFTMPQIQDQTPSPSDGSTRQSSPEPNSPSEYCFYVDSDMVNEFKLELVEKLF 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I 
481 NPESLELSFTMPQIQDQTPSPSDGSTRQSSPEPNSPSEYCFYVDSDMVNEFKLELVEKLF 540 

541 AEDTEAKNPFSTQDTDLDLEMLAPYIPMDDDFQLRSFDQLSPLESSSASPESASPQSTVT 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
541 AEDTEAKNPFSTQDTDLDLEMLAPYIPMDDDFQLRSFDQLSPLESSSASPESASPQSTVT 600 

601 VFQQTQIQEPTANATTTTATTDELKTVTKDRMEDIKILIASPSPTHIHKETTSATSSPYR 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
601 VFQQTQIQEPTANATTTTATTDELKTVTKDRMEDIKILIASPSPTHIHKETTSATSSPYR 660 

661 DTQSRTASPNRAGKGVIEQTEKSHPRSPNVLSVALSQRTTVPEEELNPKILALQNAQRKR 720 

I I I I I M I I I I I I I I I I I I II I I I II I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I 
661 DTQSRTASPNRAGKGVIEQTEKSHPRSPNVLSVALSQRTTVPEEELNPKILALQNAQRKR 720 

721 KMEHDGSLFQAVGIGTLLQQPDDHAATTSLSWKRVKGCKSSEQNGMEQKTIILIPSDLAC 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
721 KMEHDGSLFQAVGIGTLLQQPDDHAATTSLSWKRVKGCKSSEQNGMEQKTIILIPSDLAC 780 

7 81 RLLGQSMDESGLPQLTSYDCEVNAPIQGSRNLLQGEELLRALDQVN 82 6 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
7 81 RLLGQSMDESGLPQLTSYDCEVNAPIQGSRNLLQGEELLRALDQVN 82 6 



us-09-922-958-4.rapb 



GenCore version 5.1.6 
copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence: 

scoring table: 



searched: 



July 27 f 2005, 07:53:38 



Search time 203.684 Seconds 
(without alignments) 
1577.480 Million cell updates/sec 



US-09-922-958-4 
4287 

1 MEGAGGANDKKKISSERRKE QGSRNLLQGEELLRALDQVN 826 



BLOSUM62 
Gapop 10.0 



Gapext 0. 5 



1741741 seqs, 388992284 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



1741741 



Database 



Pub! i shed^ppl i cati ons_AA : 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 



/cgn2„6/ptodata/l/pubpaa/US07_PUBCOMB. pep: * 
/cgn2_6/ptodata/l/pubpaa/PCT_NEW_PUB . pep : * 
/cgn2_6/ptodata/l/pubpaa/us06_NEW_PUB . pep : * 
/cgn2_6/ptodata/l/pubpaa/us06_PUBCOMB . pep : * 
/cgn2_6/ptodata/l/pubpaa/US07_NEW_PUB . pep : * 
/cgn2_6/ptodata/l/pubpaa/PCTUS_PUBCOMB . pep : 
/cgn2_6/ptodata/l/pubpaa/US08_NEW_PUB . pep : * 
/cgn2_6/ptodata/l/pubpaa/US08_PUBCOMB. pep: * 
/cgn2_6/ptodata/l/pubpaa/US09/UPUBCOMB . pep : 
/cgn2_6/ptodata/l/pubpaa/US09B_PUBCOMB . pep 
/cgn2_6/ptodata/l/pubpaa/us09C_PUBCOMB . pep 
/cgn2_6/ptodata/l/pubpaa/us09_NEW_PUB . pep 
/cgn2_6/ptodata/l/pubpaa/USl0A_PUBCOMB . pep 
/cgn2_6/ptodata/l/pubpaa/USl0B„PUBCOMB . pep 
/cgn2_6/ptodata/l/pubpaa/USl0C_PUBCOMB . pep 
/cgn2_6/ptodata/l/pubpaa/USl0D_PUBCOMB . pep 
/cgn2_6/ptodata/l/pubpaa/USl0E_PUBCOMB . pep 
/cgn2_6/ptodata/l/pubpaa/usl0_NEW_PUB . pep : 
/cgn2_6/ptodata/l/pubpaa/usllA_PUBCOMB . pep 
/cgn2_6/ptodata/l/pubpaa/USll_NEW_PUB . pep 
/cgn2_6/ptodata/l/pubpaa/US60_NEW_PUB . pep 
/cgn2_6/ptodata/l/pubpaa/US60_PUBCOMB . pep 



: * 
: * 
: * 
: * 
: * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



% 

Result Query 

No. Score Match Length DB ID 



SUMMARIES 



Description 
Sequence 4, Appli 



4287 100.0 



826 9 US-09-922-958-4 
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Sequence 235, App 
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US-09-736-457-330 


Sequence 330, App 
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100. 
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US-09-902-941-330 


Sequence 330, App 
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4287 


100. 
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826 
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US-09-849-626-330 


Sequence 330, App 
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4287 


100. 
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826 


10 


US-09-967-388-4 


Seauence 


4 Addi i 
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4287 


100. 
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826 


10 


US-09-476-300-330 


Seauence 


330 Add 

*j \J , /V yj yj 
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4287 


100. 
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US-10-028-158-23 


Seauence 
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4287 


100. 
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US-10-101-812-10 


Seauence 
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100. 
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US-10-101-662A-9 


Seauence 
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US-10-017-754-330 


Seauence 


330 Add 

*j *** v | ) \ y* yj 
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4287 


100. 
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826 
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US-10-115-987B-14 


Seauence 


14 AddI 


13 


4287 


100. 
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826 
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US-10-287-670-9 


Sequence 


9 ADDli 


14 


4287 


100. 


0 


826 
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US-10-113-872-330 


Sequence 


330 Add 

2 ADDli 


15 


4287 


100. 


0 


826 


14 


US-10-423-419-2 


Sequence 


16 


4287 


100. 


0 


826 


15 


US-10-283-017-330 


Sequence 


330 Add 

•J -j v , /» y* y* 

1 ADDli 


17 


4287 


100. 


0 


826 


16 


US-10-032-361-1 


Sequence 


18 


4287 


100. 


0 


826 


16 


US-10-859-935-10 


Seauence 


10, Appl 


19 


4287 


100. 


0 


826 


16 


US-10-859-935-23 


Sequence 


23. AddI 


20 


4287 


100. 


0 


826 


17 


US-10-901-583-18 


Sequence 


18. AddI 


21 


4287 


100. 


0 


826 


18 


US-10-472-595A-35 


Sequence 


35, Appl 


22 


4287 


100. 


0 


826 


20 


US-11-043-493-23 


Sequence 


23, Appl 


23 


4282 


99. 


9 


826 


14 


US-10-101-816-5 


Sequence 


5 , Appl i 


24 


4282 


99. 
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826 


16 


US-10-854-483-5 


Sequence 


5, Appli 


25 
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99. 
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826 


14 


US-10-101-816-2 


Sequence 


2 , Appl i 


26 


4279 


99. 


8 


826 


16 


US-10-854-483-2 


Sequence 


2 , Appl i 


27 


4274 


99. 
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826 


14 


US-10-101-816-6 


Sequence 


6 . AddI i 

\j | 9 \ y* y* ■ * 


28 


4274 


99 
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826 


16 


US-10-854-483-6 


Sequence 


6. ADDli 


29 


4273. 5 


99 
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827 
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US-09-919-039-149 


Sequence 


149. ADD 

I | 9 \ Y* Y+ 


30 


4273.5 


99 
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827 


14 


US-10-247-671-137 


Sequence 


137. ADD 

^ t | 9 \ y* J-* 

7, Appli 


31 


4266 


99 


5 


826 


14 


US-10-101-816-7 


Sequence 


32 


4266 


99 


5 


826 


16 


US-10-854-483-7 


Sequence 


7 , Appl i 


33 


3814 


89 


.0 


735 


16 


US-10-032-361-2 


Sequence 


2, Appli 


34 


3803 . 5 


88 


,7 


823 


14 


US-10-205-342-13 


Sequence 


13, Appl 


35 


3797 


88 


.6 


836 


18 


US-10-988-486-6 


Sequence 


6, Appli 


36 


3135 . 5 


73 


,1 


623 


10 


US-09-967-388-2 


Sequence 


2, Appli 


37 


2716 


63 


.4 


542 


15 


US-10-264-049-2606 


sequence 


2606, Ap 


38 


1764.5 


41 


.2 


870 


14 


US-10-121-235-6 


Sequence 


6, Appli 


39 


1735.5 


40 


.5 


870 


14 


US-10-101-816-4 


Sequence 


4, Appli 


40 


1735.5 


40 


.5 


870 


16 


US-10-854-483-4 


Sequence 


4, Appli 


41 


1458 


34 


.0 


485 


14 


US-10-121-235-4 


Sequence 


4, Appli 


42 


1182 . 5 


27 


.6 


632 


15 


US-10-425-833-10 


Sequence 


10, Appl 


43 


1132.5 


26 


.4 


705 


14 


US-10-154-386-2 


Sequence 


2, Appli 


44 


1104 


25 


.8 


595 


15 


US-10-425-833-7 


Sequence 


7, Appli 


45 


906 


21 


.1 


333 


9 


US-09-925-302-659 


Sequence 659, App 



ALIGNMENTS 



RESULT 1 
US-09-922-958-4 

; Sequence 4, Application US/09922958 
; Patent No. US20020048794A1 
; GENERAL INFORMATION: 

; APPLICANT: POELLINGER, Lorenz 

; APPLICANT: PEREIRA, Teresa 

; APPLICANT: RUAS, Jorge 

; TITLE OF INVENTION: MECHANISM OF CONDITIONAL REGULATION OF THE HYPOXIA-INDUCIBLE 
FACTOR- 1 BY 

; TITLE OF INVENTION: THE VON HIPPEL-LINDAU TUMOR SUPPRESSOR PROTEIN 

; FILE REFERENCE: 3743/49008 

; CURRENT APPLICATION NUMBER: US/09/922 , 958 

; CURRENT FILING DATE: 2001-08-07 
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us-09-922-958-4.rapb 
; PRIOR APPLICATION NUMBER: US 60/223,480 
; PRIOR FILING DATE: 2000-08-07 
; NUMBER OF SEQ ID NOS: 7 
; SOFTWARE: Patentln version 3.0 
; SEQ ID NO 4 
; LENGTH: 826 
; TYPE: PRT 

; ORGANISM: Homo sapiens 
US-09-922-958-4 



Query Match 100.0%; score 4287; DB 9; Length 826; 

Best Local Similarity 100.0%; Pred. No. 3.1e-289; 

Matches 826; conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


1 


MEGAGGANDKKKISSERRKEKSRDAARSRRSKESEVFYELAHQLPLPHNVSSHLDKASVM 


60 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1 


MFGAGGANnKKKTS^FRRKFK^RnAAR^RR^KF^FVFYFI AHOI PI PHNV^HI nKAWM 


KJ\J 


Qy 


61 


RLTISYLRVRKLLDAGDLDIEDDMKAQMNCFYLKALDGFVMVLTDDGDMIYISDNVNKYM 


120 




1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 I 1 1 1 1 1 1 1 1 




Db 


61 


RLTISYLRVRKLLDAGDLDIEDDMKAOMNCFYLKALDGFVMVLTDDGDMTYTSDNVNKYM 

1 \ l— 1 Am 1 i— 1 \ V IX IX 1— 1— U//VXJ L-/ I— Lf/ ^ ^ Li/ L/ 1 * 1 IXj\V^I * 1 1 \w 1 1 1— IX/a L_ \J VJ | VI'IV L 1 U L/ V)LSI*I J- I X J VJ IX V ll IX 1 1*1 


120 


Qy 


121 


GLTQFELTGHSVFDFTHPCDHEEMREMLTHRNGLVKKGKEQNTQRSFFLRMKCTLTSRGR 


180 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


121 


Gl TOFFI TGHSVFDFTHPCDHFFMRFMI THRNGI VKKGKFONITORSFFI RMKCTI T^RGR 
vj l_ i v{ i i— i— i vji u v rur ■ \ i r v— vj\ ii_i_iir\i_iii_ i 1 1 r\ i n vj i_ v ix ix vj ix l_ v^i i i v^r\.j i i i_ r\i ¥ irxv_ i i_ i j r\vj i\ 


1 RO 


Qy 


181 


TMNIKSATWKVLHCTGHIHVYDTNSNQPQCGYKKPPMTCLVLICEPIPHPSNIEIPLDSK 


240 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 




Db 


181 


TMNIKSATWKVLHCTGHIHVYDTNSNOPOCGYKKPPMTCLVLICEPIPHPSNTETPL DSK 

i imi n j. ix jn i vv ix v i— i i >w i vj i i j. i i v i \j i ix^i nv^r i ix ix i i i * i i v— i_ v l_ j. v- i_ r j>. t iir / 1 -L. i— jl i l. \j ~j ix 


240 


Qy 


241 


TFLSRHSLDMKFSYCDERITELMGYEPEELLGRSIYEYYHALDSDHLTKTHHDMFTKGQV 


300 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


741 

L. *T_1_ 


TFI ^RH^I DMK F^YmFRTTFI M^YFPFFI 1 flR^TYFYYHAI n^hHI TKTHHPiMFTKCIOV 

1 1 L Jl\n JLLTIIXi J I v.LSCf\X 1 C 1_ 1 v l VJ I LrLLLLVSrVJl. I C 1 1 fl L/ ^ L/ li L_ 1 IX 1 nnL/l'lr 1 IX VJV^ V 


J\J\J 


Qy 


301 


TTGQYRMLAKRGGYVWVETQATVIYNTKNSQPQCIVCVNYWSGIIQHDLIFSLQQTECV 


360 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 




TTGOYRMI AKRn^YVWVFTnATVTYNTKNSOPnrTVrVNYAA/^nTTriHni TF^I OOTFCV 


360 


Qy 


361 


LKPVESSDMKMTQLFTKVESEDTSSLFDKLKKEPDALTLLAPAAGDTIISLDFGSNDTET 


420 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


361 


LKPVESSDMKMTQLFTKVESEDTSSLFDKLKKEPDALTLLAPAAGDTIISLDFGSNDTET 


420 


Qy 


421 


DDQQLEEVPLYNDVMLPSPNEKLQNINLAMSPLPTAETPKPLRSSADPALNQEVALKLEP 


480 




III II IMIIMI II II III III II II II Mill MM II II II II II II II II II MM 




Db 


421 


DDQQLEEVPLYNDVMLPSPNEKLQNINLAMSPLPTAETPKPLRSSADPALNQEVALKLEP 


480 


Qy 


481 


NPESLELSFTMPQIQDQTPSPSDGSTRQSSPEPNSPSEYCFYVDSDMVNEFKLELVEKLF 


540 




1 1 1 II 1 1 1 1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 h 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 




Db 


481 


NPESLELSFTMPQIQDQTPSPSDGSTRQSSPEPNSPSEYCFYVDSDMVNEFKLELVEKLF 


540 


Qy 


541 


AEDTEAKNPFSTQDTDLDLEMLAPYIPMDDDFQLRSFDQLSPLESSSASPESASPQSTVT 


600 




1 1 1 II 1 1 II 1 M 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 I I 1 1 : 1 1 1 1 1 1 1 1 II 1 1 1 1 Ml 1 1 1 1 1 M ' 1 




Db 


541 


AEDTEAKNPFSTQDTDLDLEMLAPYIPMDDDFQLRSFDQLSPLESSSASPESASPQSTVT 


600 


Qy 


601 


VFQQTQIQEPTANATTTTATTDELKTVTKDRMEDIKILIASPSPTHIHKETTSATSSPYR 


660 




1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


601 


VFQQTQIQEPTANA 1 1 1 1 ATTDELKTVTKDRMEDIKILIASPSPTHIHKETTSATSSPYR 


660 


Qy 


661 


DTQSRTASPNRAGKGVIEQTEKSHPRSPNVLSVALSQRTTVPEEELNPKILALQNAQRKR 


720 




II 1 1 1 1 1 1 1 1 1 1 1 1 INI 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 il 1 1 1 1 1 ! 1 




Db 


661 


DTQSRTASPNRAGKGVIEQTEKSHPRSPNVLSVALSQRTTVPEEELNPKILALQNAQRKR 


720 


Qy 


721 


KMEHDGSLFQAVGIGTLLQQPDDHAATTSLSWKRVKGCKSSEQNGMEQKTIILIPSDLAC 


780 
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us-09-922-958-4.rapb 
1 1 1 M M I II 1 1 1 I II I II I II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I 1 1 1 II 1 1 1 1 II 1 1 

Db 721 KMEHDGSLFQAVGIGTLLQQPDDHAATTSLSWKRVKGCKSSEQNGMEQKTIILIPSDLAC 780 

Qy 781 RLLGQSMDESGLPQLTSYDCEVNAPIQGSRNLLQGEELLRALDQVN 826 

1 1 1 1 I II I II I I II I 1 1 M I M I I I II I 1 1 1 1 1 1 II 1 1 I 1 1 1 1 1 1 1 

Db 781 RLLGQSMDESGLPQLTSYDCEVNAPIQGSRNLLQGEELLRALDQVN 826 

RESULT 2 

US-09-833-790-235 

Sequence 235, Application US/09833790 
Patent No. US20020068288A1 
GENERAL INFORMATION: 
APPLICANT: Lodes, Michael J. 
APPLICANT: Wang, Tongtong 
APPLICANT: Secrist, Heather 
APPLICANT: Mohamath, Raodoh 
APPLICANT: indirias, Carol Y. 
APPLICANT: Fan, Liqun 

TITLE OF INVENTION: COMPOSITIONS AND METHODS FOR THE THERAPY 
TITLE OF INVENTION: AND DIAGNOSIS OF LUNG CANCER 
FILE REFERENCE: 210121.512 

CURRENT APPLICATION NUMBER: US/09/833 , 790 
CURRENT FILING DATE: 2001-04-11 
NUMBER OF SEQ ID NOS: 440 
SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 235 
LENGTH: 826 
TYPE: PRT 

ORGANISM : Homo sapiens 
US-09-833-790-235 

Query Match 100.0%; Score 4287; DB 9; Length 826; 

Best Local Similarity 100.0%; Pred. No. 3.1e-289; 

Matches 826; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

MEGAGGANDKKKISSERRKEKSRDAARSRRSKESEVFYELAHQLPLPHNVSSHLDKASVM 60 
I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
MEGAGGANDKKKISSERRKEKSRDAARSRRSKESEVFYELAHQLPLPHNVSSHLDKASVM 60 

RLTISYLRVRKLLDAGDLDIEDDMKAQMNCFYLKALDGFVMVLTDDGDMIYISDNVNKYM 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I 
RLTISYLRVRKLLDAGDLDIEDDMKAQMNCFYLKALDGFVMVLTDDGDMIYISDNVNKYM 120 

GLTQFELTGHSVFDFTHPCDHEEMREMLTHRNGLVKKGKEQNTQRSFFLRMKCTLTSRGR 180 

I 1 1 1 1 I I 1 1 I 1 1 1 11 1 I I I II I I 1 1 1 1 1 1 1 1 1 1 I I I I I 1 1 Ml I I I 1 1 I II M II 1 1 1 1 

GLTQFELTGHSVFDFTHPCDHEEMREMLTHRNGLVKKGKEQNTQRSFFLRMKCTLTSRGR 180 
TMNIKSATWKVLHCTGHIHVYDTNSNQPQCGYKKPPMTCLVLICEPIPHPSNIEIPLDSK 240 

1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 M 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II I MM I 1 1 1 1 I II I 1 1 1 1 1 1 1 1 1 1 

TMNIKSATWKVLHCTGHIHVYDTNSNQPQCGYKKPPMTCLVLICEPIPHPSNIEIPLDSK 240 
TFLSRHSLDMKFSYCDERITELMGYEPEELLGRSIYEYYHALDSDHLTKTHHDMFTKGQV 300 

I I II 1 1 1 1 II M I I I I I 1 1 I I I Ml I I M M I II I M 1 1 1 1 II I II I I M 1 1 1 1 1 1 1 1 1 1 

TFLSRHSLDMKFSYCDERITELMGYEPEELLGRSIYEYYHALDSDHLTKTHHDMFTKGQV 300 
TTGQYRMLAKRGGYVWVETQATVIYNTKNSQPQCIVCVNYWSGIIQHDLIFSLQQTECV 360 

I I 1 1 II 1 1 1 1 I M II I I 1 1 1 1 I I Ml 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 I M I 1 1 1 1 I II II II II 1 1 

TTGQYRMLAKRGGYVWVETQATVIYNTKNSQPQCIVCVNYWSGIIQHDLIFSLQQTECV 360 
LKPVESSDMKMTQLFTKVESEDTSSLFDKLKKEPDALTLLAPAAGDTIISLDFGSNDTET 420 

1 1 1 1 1 1 1 1 1 1 1 1 1 M I Ml 1 1 1 1 1 1 M 1 1 1 1 1 1 1 II 1 1 III I II 1 1 I Ml 1 1 1 1 1 1 1 1 II 

LKPVESSDMKMTQLFTKVESEDTSSLFDKLKKEPDALTLLAPAAGDTIISLDFGSNDTET 420 
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Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 



us-09-922-958-4.rapb 
421 DDQQLEEVPLYNDVMLPSPNEKLQNINLAMSPLPTAETPKPLRSSADPALNQEVALKLEP 480 

1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MM 1 1 1 II 1 1 1 I! 1 1! 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 M 

421 DDQQLEEVPLYNDVMLPSPNEKLQNINLAMSPLPTAETPKPLRSSADPALNQEVALKLEP 480 
481 NPESLELSFTMPQIQDQTPSPSDGSTRQSSPEPNSPSEYCFYVDSDMVNEFKLELVEKLF 540 

1 1 M 1 1 II 1 1 1 1 1 1 1 1 1 M I II MM 1 1 1 1 II 1 1 Ml I M 1 1 1 1 1 II I II I M I II 1 1 1! 

481 NPESLELSFTMPQIQDQTPSPSDGSTRQSSPEPNSPSEYCFYVDSDMVNEFKLELVEKLF 540 
541 AEDTEAKNIPFSTQDTDLDLEMLAPYIPMDDDFQLRSFDQLSPLESSSASPESASPQSTVT 600 

I M 1 1 1 II 1 1 1 1 1 1 1 1 M 1 1 II MM 1 1 1 1 1 1 1 1 M I M 1 1 1 1 1 1 II I II I Ml II 1 1 II 

541 AEDTEAKNPFSTQDTDLDLEMLAPYIPMDDDFQLRSFDQLSPLESSSASPESASPQSTVT 600 
601 VFQQTQIQEPTANATTTTATTDELKTVTKDRMEDIKILIASPSPTHIHKETTSATSSPYR 660 

I M 1 1 1 II 1 1 1 1 1 1 1 1 M 1 1 1 1 1 Ml 1 1 1 1 II 1 1 II I M 1 1 1 I II 1 1 1 M I M I II 1 1 II 

601 VFQQTQIQEPTANATTTTATTDELKTVTKDRMEDIKILIASPSPTHIHKETTSATSSPYR 660 
661 DTQSRTASPNRAGKGVIEQTEKSHPRSPNVLSVALSQRTTVPEEELNPKILALQNAQRKR 720 

I M I 1 1 II 1 1 I IM M M I M I M M 1 1 1 1 I II I IM I II 1 1 1 1 M 1 1 M 1 1 1 1 Ml I 1 1 

661 DTQSRTASPNRAGKGVIEQTEKSHPRSPNVLSVALSQRTTVPEEELNPKILALQNAQRKR 720 
721 KMEHDGSLFQAVGIGTLLQQPDDHAATTSLSWKRVKGCKSSEQNGMEQKTIILIPSDLAC 780 

I M M I II 1 1 1 IM M 1 1 I M I M M 1 1 1 1 1 1 1 MM 1 1 1 1 1 1 1 M 1 1 II 1 1 II II 1 1 II 

721 KMEHDGSLFQAVGIGTLLQQPDDHAATTSLSWKRVKGCKSSEQNGMEQKTIILIPSDLAC 780 
781 RLLGQSMDESGLPQLTSYDCEVNAPIQGSRNLLQGEELLRALDQVN 826 

IM M III III IM MM MM MM II II I III IMM III III I 

781 RLLGQSMDESGLPQLTSYDCEVNAPIQGSRNLLQGEELLRALDQVN 826 
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GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score; 
Sequence : 

Scoring table: 



July 27, 2005, 07:42:11 ; Search time 48.8091 Seconds 

(without alignments) 
1628.284 Million cell updates/sec 

US-09-922-958-4 
4287 

1 MEGAGGANDKKKISSERRKE QGSRNLLQGEELLRALDQVN 82 6 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 283416 seqs, 96216763 residues 

Total number of hits satisfying chosen parameters; 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



283416 



Database 



PIR_79: * 
pirl : * 
pir2 : * 
pir3 : * 
pir4 : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result 



Query 



No. 


Score 


Match 


Length 


DB 


ID 


Description 


1 


4287 


100. 


0 


826 


2 


138972 


hypoxia- inducible 


2 


3764.5 


87. 


8 


813 


2 


JC5809 


hypoxia- inducible 


3 


3754 


87. 


6 


810 


2 


JC4837 


hypoxia- inducible 


4 


3406. 5 


79. 


5 


811 


2 


JC7619 


hypoxia- inducible 


5 


1165.5 


27. 


2 


667 


2 


JC7771 


hypoxia inducible 


6 


888.5 


20. 


7 


1505 


2 


JC4851 


hypoxia- inducible 


7 


570 


13. 


3 


655 


2 


A29945 


neurogenesis regul 


8 


512 


11. 


9 


823 


2 


T21943 


hypothetical prote 


9 


511 


11. 


9 


825 


2 


T21944 


hypothetical prote 


10 


441.5 


10. 


3 


248 


2 


A58520 


single-minded gene 


11 


422 


9. 


8 


776 


2 


A55448 


Ah receptor nuclea 


12 


413. 5 


9. 


6 


791 


2 


A56241 


aryl hydrocarbon r 


13 


397.5 


9. 


3 


626 


2 


JC5405 


brain and muscle A 



14 


396 


9. 


2 


626 


2 


JE0270 


Arnt- like* PA*-! nrnt" 


15 


396 


9. 


2 


846 


2 


JC7721 


arvl hvdroca rbon r 


16 


390. 5 


9. 


1 


789 


2 


159550 


aryl hydrocarbon r 


17 


390 . 5 


9. 


1 


805 


2 


A46266 


arvl hvdrora rbon r 


18 


377 . 5 


8 . 


8 


805 


2 


JC7635 


arvl hvHrnr^rhnn r 

a x. y j. 1 1 y \a x. x^* a x. xv ^ i i x. 


19 


370 


8 . 


6 


834 


2 


JC7993 


arvl h vH rnra rh r» n r 

ci i y x 1 1 y ui. w ^« u x, xj uil X. 


20 


362 . 5 


8 . 


5 


848 


2 


S59514 


a rul H vH rnrarhnn r 
cij-vx iiyuiuuciLJJUii x. 


21 


362 . 5 


8 . 


5 


853 


2 


S58375 


arvl h vH rnrarhnn r 

ui yx ii y \a i. <j\a>ai. xv w 1 1 x. 


22 


361 


8 . 


4 


920 


2 


JC7313 


ar\/l for*^ rhnn k 


23 


348.5 


8 . 


1 


1059 


2 


T30557 


a r v 1 h vH rnrs rhon r 


24 


346. 5 


8. 


1 


1058 


2 


T30556 


aryl hydro carbon r 


25 


338 


7. 


9 


358 


2 


S58376 


arvl hvdroca rbon r 


26 


338 


7. 


9 


1023 


2 


T13068 


CLOCK nrni"Pin — f r 


27 


336 


7. 


8 


1015 


2 


T13062 


CLOCK nrnrpin — f r 


28 


329 


7. 


7 


1027 


2 


T13071 


CLOCK nrotein — f r 


2 9 


315.5 


7. 


4 


392 


2 


JC7633 


Cl J- _y -L l L y KJL I. KJ K^CX i- XJ \J Ll 11 


30 


301 


7. 


0 


300 


2 


T24292 


hi vr* o t" Vi ^ t* i ra 1 nrnt*p 

11 V L> V_/ L-11C <— < _1_ Vw, d _1_ Ly J_ W L- *3 


31 


277 


6.5 


602 


2 


T19898 


a r\;1 Vi vH ror , arVion r 
cl i- y a. iiyui.uucij.jjun j_ 


32 


273.5 


6. 


4 


14 62 


2 


T42639 


rr 1 i]rnrnr1"i rni H r ^ p 


33 


269 


6. 


3 


1424 


2 


T03851 
j. \j *j \j *j j. 


i"hvrni H hnrnionp k 
Liiyiuiu nu x. iiiw lie x. c 


34 


268 


6. 


3 


647 


2 


T19307 


h vnot* h t i pal n rnfp 


35 


266. 5 


6. 


2 


650 


2 


G87883 


nrotein C41 G7 5 H 


36 


250 . 5 


5. 


8 


451 


2 


T42397 


a r vl hvdroca rbon r 

tl X. Jf J. J 1- w wC* 1. Jw* Ull X. 


37 


244.5 


5. 


7 


453 


2 


T19440 


h vn othet i pal n rnt"p 


38 


231 . 5 


5. 


4 


1016 


2 


T30990 


npri od nrotpi n — C 

C J L WU LCXU \^ 


39 


231. 5 


5. 


4 


1018 


2 


T30986 


period protein - C 


40 


216 


5. 


0 


1122 


2 


B26427 


period clock prote 


41 


216 


5. 


0 


1127 


2 


A25018 


circadian rhythm p 


42 


216 


5. 


0 


1176 


2 


C26427 


period clock prote 


43 


216 


5. 


0 


1218 


2 


A26588 


period clock prote 


44 


216 


5. 


0 


1218 


2 


A26427 


period clock prote 


45 


215.5 


5. 


0 


1208 


2 


S17286 


period clock prote 



ALIGNMENTS 



RESULT 1 
138972 

hypoxia-inducible factor 1 alpha - human 
N;Alternate names: ARNT interacting protein 
C; Species: Homo sapiens (man) 

C;Date: 23-Feb-1996 #sequence_revision 23-Feb-1996 #text_change 09-Jul-2004 
C;Accession: 138972; G01875 

R;Wang, G.L.; Jiang, B.H.; Rue, E.A. ; Semenza, G.L. 
Proc. Natl. Acad. Sci. U.S.A. 92, 5510-5514, 1995 

A; Title: Hypoxia-inducible factor 1 is a basic-helix-loop-helix- PAS heterodimer 
regulated by cellular 02 tension. 

A;Reference number: 138972; MUID : 95296340 ; PMID: 7539918 
A; Accession: 138972 
A; Status: preliminary 
A; Molecule type: mRNA 
A; Residues: 1-826 <RES> 

A/Cross-references: UNIPROT : Q16665-; EMBL:U22431; NID:g881345; PIDN : AAC50152 . 1 ; 
PID:g881346 

A;Note: parts of this sequence were confirmed by peptide sequencing 
R;Hogenesch, J.B.; Chan, W.K.; Carver, L.A. ; Bradfield, C.A. 



submitted to the EMBL Data Library, June 1995 
A; Reference number: H00692 
A; Accession : G01875 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A;Residues: 1-826 <HOG> 

A;Cross-references : EMBL : U29165 ; NID : gll44012 ; PIDN : AAC51210 . 1 ; PID:gll44013 

C; Genetics : 

A; Gene: GDB : HIF1A 

A;Cross-ref erences : GDB: 512229 

A; Map position: 14q21-14q24 

C; Keywords : heterodimer 



Query Match 100.0%; Score 4287; DB 2; Length 826; 

Best Local Similarity 100.0%; Pred. No. 8.2e-245; 

Matches 826; Conservative 0; Mismatches 0; Indels 0; 



Gaps 



0; 



Qy 

Db 



MEGAGGANDKKKI S S ERRKEKS RDAARSRRSKES EVFYELAHQLPLPHNVS SHLDKASVM 60 
I I I I I I I I II I II i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
MEGAGGANDKKKI SSERRKEKSRDAARSRRSKESEVFYELAHQLPLPHNVS SHLDKASVM 60 



Qy 

Db 



61 RLTISYLRVRKLLDAGDLDIEDDMKAQMNCFYLKALDGFVMVLTDDGDMIYISDNVNKYM 120 

I I H I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I 
61 RLTISYLRVRKLLDAGDLDIEDDMKAQMNCFYLKALDGFVMVLTDDGDMIYISDNVNKYM 120 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 
■ Db 

Qy 
Db 

Qy 

Db 



121 GLTQFELTGHSVFDFTHPCDHEEMREMLTHRNGLVKKGKEQNTQRSFFLRMKCTLTSRGR 180 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
121 GLTQFELTGHSVFDFTHPCDHEEMREMLTHRNGLVKKGKEQNTQRSFFLRMKCTLTSRGR 180 

181 TMNIKSATWKVLHCTGHIHVYDTNSNQPQCGYKKPPMTCLVLICEPIPHPSNIEIPLDSK 240 

I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
181 TMNIKSATWKVLHCTGHIHVYDTNSNQPQCGYKKPPMTCLVLICEPIPHPSNIEIPLDSK 240 

241 TFLSRHSLDMKFSYCDERITELMGYEPEELLGRSIYEYYHALDSDHLTKTHHDMFTKGQV 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
241 TFLSRHSLDMKFSYCDERITELMGYEPEELLGRSIYEYYHALDSDHLTKTHHDMFTKGQV 300 

301 TTGQYRMIAKRGGYVWETQATVIYNTKNSQPQCIVCVNYWSGIIQHDLIFSLQQTECV 360 

I I I I I I I I I I I I I I! I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
301 TTGQYRMLAKRGGYVWETQATVIYNTKNSQPQCIVCVNYWSGIIQHDLIFSLQQTECV 360 

361 LKPVESSDMKMTQLFTKVESEDTSSLFDKLKKEPDALTLLAPAAGDTIISLDFGSNDTET 420 

I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
361 LKPVESSDMKMTQLFTKVESEDTSSLFDKLKKEPDALTLLAPAAGDTIISLDFGSNDTET 420 

421 DDQQLEEVPLYNDVMLPSPNEKLQNINLAMSPLPTAETPKPLRSSADPALNQEVALKLEP 4 80 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
421 DDQQLEEVPLYNDVMLPSPNEKLQNINLAMSPLPTAETPKPLRSSADPALNQEVALKLEP 4 80 

4 81 NPESLELSFTMPQIQDQTPSPSDGSTRQSSPEPNSPSEYCFYVDSDMVNEFKLELVEKLF 54 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
4 81 NPESLELSFTMPQIQDQTPSPSDGSTRQSSPEPNSPSEYCFYVDSDMVNEFKLELVEKLF 540 

541 AEDTEAKNPFSTQDTDLDLEMLAPYIPMDDDFQLRSFDQLSPLESSSASPESASPQSTVT 600 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
541 AEDTEAKNPFSTQDTDLDLEMLAPYIPMDDDFQLRSFDQLSPLESSSASPESASPQSTVT 600 



Qy 601 VFQQTQIQEPTANATTTTATTDELKTVTKDRMEDIKILIASPSPTHIHKETTSATSSPYR 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 601 VFQQTQIQEPTANATTTTATTDELKT VTKDRMEDI KI LIAS PS PTHIHKETT SATS S PYR 660 

Qy 661 DTQS RTAS PNRAGKGVI EQTEKSH PRS PNVLSVALSQRTTVPEEELNPKI LALQNAQRKR 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 661 DTQS RTAS PNRAGKGVI EQTEKSHPRS PNVLSVALSQRTTVPEEELNPKI LALQNAQRKR 720 

Qy 721 KMEHDGSLFQAVGIGTLLQQPDDHAATTSLSWKRVKGCKSSEQNGMEQKTIILIPSDLAC 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
Db 721 KMEHDGSLFQAVGIGTLLQQPDDHAATTSLSWKRVKGCKSSEQNGMEQKTIILIPSDLAC 780 

Qy 781 RLLGQSMDESGLPQLTSYDCEVNAPIQGSRNLLQGEELLRALDQVN 826 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 781 RLLGQSMDESGLPQLTSYDCEVNAPIQGSRNLLQGEELLRALDQVN 826 



RESULT 2 
JC5809 

hypoxia-inducible factor 1 alpha - rat 
C; Species: Rattus norvegicus (Norway rat) 

C;Date: 04-Feb-1998 #sequence_revision 13-Mar-1998 #text_change 26-Aug-1999 
C /Accession : JC58 09 
R;Ladoux, A.; Frelin, C. 

Biochem. Biophys. Res. Commun. 240, 552-556, 1997 

A;Title: Cardiac expressions of HIF-1 alpha and HLF/EPAS, two basic loop 
helix/PAS domain transcription factors involved in adaptative responses to 
hypoxic stresses. 

A;Reference number: JC5809; MUID : 98 063274 ; PMID: 9398602 
A; Access ion : JC58 09 

A; Status: nucleic acid sequence not shown 
A;Molecule type: mRNA 
A; Residues: 1-813 <LAD> 

C; Comment: This protein associates to the aryl hydrocarbon receptor nuclear 
translocator, binds to specific hypoxia responsive elements and activates the 
transcription of hypoxia sensitive genes such as erythropoietin, vascular 
endothelial growth factor, the GLUT1 glucose transporter, enzymes of gycolysis 
and the inducible form of nitric oxyde synthase. 
F; 6-144/Region : basic helix-loop-helix #status predicted 

Query Match 87.8%; Score 3764.5; DB 2; Length 813; 

Best Local Similarity 90.1%; Pred. No. 4.9e-214; 

Matches 737; Conservative 24; Mismatches 48; Indels 9; Gaps 4; 

Qy 13 ISSERRKEKSRDAARSRRSKESEVFYELAHQLPLPHNVSSHLDKASVMRLTISYLRVRKL 72 

: I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MSSERRKEKSRDAARSRRSKESEVFYELAHQLPLPHNVSSHLDKASVMRLTISYLRVRKL 60 

Qy 73 LDAGDLDIEDDMKAQMNCFYLKALDGFVMVLTDDGDMIYISDNVNKYMGLTQFELTGHSV 132 

! I I I I I I I I I : I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II 
Db 61 LDAGDLDIEDEMKAQMNCFYLKALDGFVMVLTDDGDMIYISDNVNKYMGLTQFELTGHSV 120 

Qy 133 FDFTHPCDHEEMREMLTHRNGLVKKGKEQNTQRSFFLRMKCTLTSRGRTMNIKSATWKVL 192 

I I I I I I 1 I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 FDFTHPCDHEEMREMLTHRNGPVRKGKEQNTQRSFFLRMKCTLTSRGRTMNIKSATWKVL 180 



Qy 



193 HCTGHIHVYDTNSNQPQCGYKKPPMTCLVLICEPIPHPSNIEIPLDSKTFLSRHSLDMKF 252 



181 HCTGHIHVYDTS SNQPQCGYKKPPMTCLVLICEPI PHPSNI EI PLDSKTFLSRHSLDMKF 240 

253 SYCDERITELMGYEPEELLGRSIYEYYHALDSDHLTKTHHDMFTKGQVTTGQYRMLAKRG 312 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
241 SYCDERITELiMGYEPEELLGRSIYEYYHALDSDHLTKTHHDMFTKGQVTTGQYRMLAKRG 300 

313 GYVWVETQATVIYNTKNSQPQCIVCVNYWSGIIQHDLIFSLQQTECVLKPVESSDMKMT 372 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
301 ' GYVWVETQATVIYNTKNSQPQCIVCVNYWSGIIQHDLIFSLQQTESVLKPVESSDMKMT 360 

373 QLFTKVESEDTSSLFDKLKKEPDALTLLAPAAGDTIISLDFGSNDTETDDQQLEEVPLYN 432 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I : I I I I I : I I I I I 
361 QLFTKVESEDTSCLFDKLKKEPDALTLLAPAAGDTIISLDFGSDDTETEDQQLEDVPLYN 420 

433 DVMLPSPNEKLQNINLAMSPLPTAETPKPLRSSADPALNQEVALKLEPNPESLELSFTMP 492 

III I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I Mill I I I I I I 
421 DVMFPSSNEKL-NINLAMSPLPASETPKPLRSSADPALNQEVALKLESSPESLGLSFTMP 479 

4 93 QIQDQTPSPSDGSTRQSSPEPNSPSEYCFYVDSDMVNEFKLELVEKLFAEDTEAKNPFST 552 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
480 QIQDQPASPSDGSTRQSSPEPNSPSEYCFDVDSDMVNVFKLELVEKLFAEDTEAKNPFSA 539 

553 QDTDLDLEMLAPYIPMDDDFQLRSFDQLSPLESSSASPESASPQSTVTVFQQTQIQEPTA 612 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I II I I I I I I I I II : I : I I 

54 0 QDTDLDLEMLAP YI PMDDDFQLRS FDQLS PLESNS PS P PSV STVTGFQQTQLQKPTI 596 

613 NAT TTTATTDELKTVTKDRMEDIKILIASPSPTHIHKETTSATSSPYRDTQSRTAS 668 

I II I I I I I I I I I : I I I I I I I I I I I : : I I : I : I I I I I I I I 

597 TVTAATATTATTTDESKAVTKDNIEDIKILIASPPSTQVPQEMTTAKASAYSGTHSRTAS 656 

669 PNRAGKGVIEQTEKSHPRSPNVLSVALSQRTTVPEEELNPKILALQNAQRKRKMEHDGSL 728 

I : I I I I I I I : I : I : I I I I I III I : I I I I I I I I I I I I : I I I I I I I II I I I I I I I I 
657 PDRAGKRVIEKTDKAHPRSLN-LSVTLNQRNTVPEEELNPKTIALQNAQRKRKMEHDGSL 715 

72 9 FQAVGIGTLLQQPDDHAATTSLSWKRVKGCKSSEQNGMEQKTIILIPSDLACRLLGQSMD 7 88 

III I I I I I I I I I I I I I I I I I I I II I I I I : I I I I I II I I I I I I I I I I I I I I I I 
716 FQAAGIGTLLQQPGDRAPTMSLSWKRVKGYISSEQDGMEQKTIFLIPSDLACRLLGQSMD 775 

789 ESGLPQLTSYDCEVNAPIQGSRNLLQGEELLRALDQVN 826 

I I I I I I I I I I I I I I I I I I I I I II I I M I I I I I I I I I I I 
77 6 ESGLPQLTSYDCEVNAPIQGSRNLLQGEELLRALDQVN 813 



us-09-922-958-4.rup 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



ON! protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence: 

Scoring table: 



Searched: 



July 27, 2005, 07:33:41 ; Search time 213.07 Seconds 

(without alignments) 
1985.153 Million cell updates/sec 

US-09-922-958-4 
4287 

1 MEGAGGANDKKKISSERRKE QGSRNLLQGEELLRALDQVN 826 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



1612378 seqs, 512079187 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



1612378 



Database 



UniProt__03:* 
1: uni prot_sprot :* 
2 : uni prot_trembl : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



7o 



SUMMARIES 



Result 
No. 


Score 


Query 
Match 


Length DB 


ID 


Description 


1 


4287 


100.0 


826 


1 


HIFA_HUMAN 


Q16665 


homo sapien 


2 


4065.5 


94.8 


823 


2 


Q6IV47 


Q6i v47 


bos mutus g 


3 


4061.5 


94.7 


823 


1 


HIFA_BOVIN 


Q9xta5 


bos taurus 


4 


4017.5 


93.7 


821 


2 


Q64F54 


Q64f54 


spermophilu 


5 


3950 


92.1 


824 


2 


Q6H8T3 


Q6h8t3 


spa! ax juda 


6 


3941.5 


91.9 


819 


2 


Q7YSE5 


Q7yse5 


oryctolagus 


7 


3857.5 


90.0 


786 


2 


Q6SLL1 


Q6slll 


cam's famil 


8 


3803.5 


88.7 


825 


1 


HIFA_RAT 


035800 


rattus norv 


9 


3803 


88.7 


836 


1 


HIFA_MOUSE 


Q61221 


mus musculu 


10 


3404.5 


79.4 


811 


1 


HIFA_CHICK 


Q9yib9 


gall us gall 


11 


2925 


68.2 


802 


2 


Q6PI54 


Q6pi54 


xenopus lae 


12 


2532.5 


59.1 


805 


1 


HIFA_XENLA 


Q9i 8a9 


xenopus lae 


13 


2334.5 


54.5 


774 


2 


Q6STN7 


Q6stn7 


ctenopharyn 


14 


2331 


54.4 


777 


2 


Q6EHI4 


Q6ehi4 


brachydanio 


15 


2076 


48.4 


766 


1 


HIFA_ONCMY 


Q98sw2 


oncornynchu 


16 


1962 


45.8 


489 


2 


Q6EMI3 


Q6emi 3 


xenopus lae 


17 


1845 


43.0 


867 


2 


Q9W7C6 


Q9w7c6 


gall us gall 


18 


1831.5 


42.7 


870 


2 


Q9PTB3 


Q9ptb3 


coturnix co 


19 


1800.5 


42.0 


862 


2 


Q6GL61 


Q6gl61 


xenopus tro 


20 


1787 


41.7 


859 


2 


Q6GQ12 


Q6gql2 


xenopus lae 


21 


1784.5 


41.6 


862 


2 


Q6GP97 


Q6gp97 


xenopus lae 
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fundulus he 


22 


1770 


41. 3 


873 


2 


080GM4 


Q8qgm4 


23 


1765 . 5 


41.2 


870 


1 


PASl_HUMAN 


Q99814 


homo sapien 


24 


1763 


41. 1 


835 


2 


0696W2 


Q696w2 


ctenopharyn 


25 


1730 5 


40 4 


870 


2 


Q9XTA4 


Q9xta4 


bos taurus 


2fi 
t- \J 


1730 5 


40 4 


874 


1 


PASl MOUSE 


P97481 


mus musculu 


77 


1 778 5 


40 3 


874 


2 


Q6PEU2 


Q6peu2 


mus musculu 


28 


1713 5 


40.0 


533 


2 


Q6PI33 


Q6pi 33 


brachydani o 




1705 5 


39 8 


874 


1 


PASl RAT 


Q9jhsl 


rattus norv 


30 


1241 

-L L. *T X 


28 9 


626 


2 


06EGR9 


Q6eg r9 


brachydani o 


31 


1237 


28.9 


643 


2 


06STN6 


Q6stn6 


ctenopharyn 


32 


1171. 5 


27.3 


662 


2 


Q9JHS2 


Q9jhs2 


rattus norv 


33 


1166 \ 5 


27 .2 


632 


2 


Q8WXA1 


Q8wxal 


homo sapien 


34 


1166! 5 


27.2 


669 


2 


Q66K72 


Q66k72 


homo sapien 


35 


1165 5 


27.2 


667 


2 


Q9Y2N7 


Q9y2n7 


homo sapien 


36 


1161. 5 


27.1 


662 


2 


Q9Z2I5 


Q9z2i5 


mus musculu 


37 


1157 


270 


630 


2 


Q9QX54 


Q9qx54 


mus musculu 


38 


1156.5 


27.0 


632 


2 


Q96K34 


Q96k34 


homo sapien 


39 


1071 


25.0 


571 


2 


Q7T2E4 


Q7t2e4 


brachydani o 


40 


1004.5 


23.4 


1057 


2 


Q6DN44 


Q6dn44 


palaemonete 


41 


1002 


23.4 


189 


2 


Q6RYC8 


Q6ryc8 


ovis aries 


42 


1001.5 


23.4 


648 


2 


Q9HAI2 


Q9hai2 


homo sapien 


43 


989 


23.1 


189 


2 


Q6RYC9 


Q6ryc9 


sus scrofa 


44 


896 


20.9 


1507 


1 


SIMA_DR0ME 


Q24167 


drosophi la 


45 


835 


19.5 


353 


2 


Q7PR97 


Q7pr97 


anopheles g 



ALIGNMENTS 



RESULT 1 
HIFA__HUMAN 

ID HIFA_HUMAN STANDARD; PRT; 826 AA. 

AC Q16665; Q96PT9; Q9UP81; 

DT 01-NOV-1997 (Rel . 35, Created) 

DT 01-NOV-1997 (Rel. 35, Last sequence update) 

DT 25-OCT-2004 (Rel. 45, Last annotation update) 

DE Hypoxia-inducible factor 1 alpha (HIF-1 alpha) (HIFl alpha) (ARNT 

DE interacting protein) (Member of PAS protein 1) (MOPl) . 

GN Name=HIFlA; 

OS Homo sapiens (Human). 

oc Eukaryota; Metazoa; chordata; craniata; vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; catarrhini; Hominidae; Homo. 

OX NCBI_TaxlD=9606 ; 

RN [1] 

RP SEQUENCE FROM N.A., AND SEQUENCE OF 166-170; 259-289 AND 771-781. 

RX MEDLINE=95296340; PubMed=7539918 ; 

RA Wang G.L., Jiang B.-H., Rue E.A., Semenza G.L.; 

RT "Hypoxia-inducible factor 1 is a basic-helix-loop-helix-PAS 

RT heterodimer regulated by cellular 02 tension."; 

RL Proc. Natl. Acad. Sci . U.S.A. 92:5510-5514(1995). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Hepatoma; 

RX MEDLINE=97236817; PubMed=9079689 ; DOI=10.1074/jbc.272.13.8581; 

RA Hogenesch J.B., Chan W.K., Jackiw V.H., Brown R.C, Gu Y.-Z., 

RA Pray-Grant M. , Perdew G.H., Bradfield C.A.; 

RT "characterization of a subset of the basic-helix-loop-helix-PAS 

RT superfamily that interacts with components of the dioxin signaling 

RT pathway."; 

RL 3. Biol. chem. 272:8581-8593(1997). 

RN [3] 

RP SEQUENCE FROM N.A. (ISOFORM 1). 

RX MEDLINE=99000835; PubMed=9782081; DOI-10. 1006/geno. 1998. 5416; 
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RA Iyer N.v. f Leung S.W., Semenza G.L.; 

RT "The human hypoxia-inducible factor lalpha gene: HIFlA structure and 

RT evolutionary conservation."; 

RL Genomics 52:159-165(1998). 

RN [4] 

RP SEQUENCE FROM N.A. 

RA Rupert J.L., Hochachka P.W. ; 

RT "HlFla sequence in the Quechua, a high altitude population."; 

RL submitted (N0V-1999) to the EMBL/GenBank/DDBJ databases. 

RN [5] 

RP SEQUENCE FROM N.A. (ISOFORM 1). 

RC TlssuE=Glial tumor; 

RA sun B. f Zhao H.R., YU R.T., Ni M.S.H.; 

RL Submitted (SEP-2000) to the EMBL/GenBank/DDBJ databases. 

RN [6] 

RP SEQUENCE FROM N.A. (ISOFORM 2). 

RC TlSSUE=Liver; 

RA Tanaka s., Sugimachi K.; 

RT "Hypoxia-inducible factor-1 alpha variant isolated from human liver 

RT tissue."; 

RL submitted (OCT-2001) to the EMBL/GenBank/DDBJ databases. 

RN [7] 

RP SEQUENCE FROM N.A. (ISOFORM 1). 

RC TlSSUE=Choriocarcinoma f and Placenta; 

RX MEDLINE=22388257; PubMed=12477932 ; DOI=10. 1073/pnas . 242603899; 

RA strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen C.M. f Schuler G.D., 

RA Altschul S.F., Zeeberg B. f Buetow K.H., Schaefer C.F. f Bhat N . K. , 

RA Hopkins R.F., Jordan H. f Moore T. , Max S.I., Wang J . , Hsieh F. f 

RA Diatchenko L., Marusina K. , Farmer A. A., Rubin G.M., Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki s., carninci P., Prange C. , 

RA Raha S.S., Loquellano N.A. , Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards s., worley K.C, Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A., 

RA Fahey 3 . , Helton E., Ketteman M. , Madan A., Rodrigues s. f Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.w. , Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood 3 . , Schmutz 3 . , Myers R.M., 

RA Butterfield Y.S.N. , Krzywinski M.I., skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A.; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci . U.S.A. 99:16899-16903(2002). 

RN [8] 

RP TRANSACTIVATION DOMAINS NTAD AND CTAD. 

RX MEDLINE=97382249; PubMed=9235919 ; DOI-10. 1074/jbc. 272. 31.19253; 

RA Jiang B.H., Zheng J.Z., Leung S.W., Roe R. , semenza G.L.; 

RT "Transacti vation and inhibitory domains of hypoxia-inducible factor 

RT lalpha. Modulation of transcriptional activity by oxygen tension."; 

RL J. Biol. chem. 272:19253-19260(1997). 

RN [9] 

RP SUBCELLULAR LOCATION, AND MUTAGENESIS OF LYS-719. 

RX MEDLINE=99043864; PubMed=9822602 ; DOI=10.1093/emboj/17.22.6573; 

RA Kallio P.J., okamoto K., O'Brien s., Carrero P., Makino Y., Tanaka H., 

RA Poel linger L. ; 

RT "Signal transduction in hypoxic cells: inducible nuclear translocation 

RT and recruitment of the CBP/p300 coactivator by the hypoxia-inducible 

RT factor-lalpha. " ; 

RL EMBO J. 17:6573-6586(1998). 

RN [10] 

RP OXYGEN-DEPENDENT DEGRADATION DOMAIN. 

Page 3 



us-09-922-958-4. rup 

RX MEDLINE=98318598; PubMed=9653127 ; DOI=10.1073/pnas.95.14.7987; 

RA Huang L.E., Gu 3., Schau M. , Bunn H.F.; 

RT "Regulation of hypoxia-inducible factor lalpha is mediated by an 02- 

RT dependent degradation domain via the ubiqui tin-proteasome pathway."; 

RL Proc. Natl. Acad. Sci . U.S.A. 95:7987-7992(1998). 

RN [11] 

RP TRANSACTIVATION DOMAINS NTAD AND CTAD, INTERACTION WITH APEX, AND 

RP MUTAGENESIS OF CYS-800. 

RX MEDLINE=99219869; PubMed=10202154; DOI=10. 1093/emboj/18 . 7. 1905 ; 

RA Ema M., Hi rota K., Mimura 3., Abe H., Yodoi J., Sogawa K. , 

RA Poellinger L., Fujii-Kuriyama Y. ; 

RT "Molecular mechanisms of transcription activation by HLF and HlFlalpha 

RT in response to hypoxia: their stabilization and redox signal -induced 

RT interaction with CBP/p300."; 

RL EMBO 3 . 18:1905-1914(1999). 

RN [12] 

RP INTERACTION WITH NCOAl; NCOA2 AND APEX. 

RX MEDLINE=20063199; PubMed=10594042 ; 

RA Carrero P., okamoto K. , Coumailleau P., O'Brien s., Tanaka H., 

RA Poel li nger L. ; 

RT "Redox-regulated recruitment of the transcriptional coactivators CREB- 

RT binding protein and SRC-1 to hypoxia-inducible factor lalpha."; 

RL Mol. Cell. Biol. 20:402-415(2000). 

RN [13] 

RP MUTAGENESIS OF SER-551 AND THR-552. 

RX MEDLINE=20243767; PubMed=10758161; DOI=10. 1073/pnas .080072497; 

RA Sutter C.H., Laughner E., semenza G.L.; 

RT "Hypoxia-inducible factor lalpha protein expression is controlled by 

RT oxygen-regulated ubiqui tination that is disrupted by deletions and 

RT missense mutations."; 

RL Proc. Natl. Acad. Sci. U.S.A. 97:4748-4753(2000). 

RN [14] 

RP UBIQUITINATION. 

RX MEDLINE=21214630; PubMed=11292861; DOI=10.1126/science. 1059796; 

RA Jaakkola P., Mole D.R., Tian Y.-M., Wilson M.I., Gielbert 3 . , 

RA Gaskell S.J., von Kriegsheim A., Hebestreit H.F., Mukherji M. , 

RA Schofield C.J., Maxwell P.H., Pugh C.W., Ratcliffe P. J.; 

RT "Targeting of HlF-alpha to the von Hi ppel -Li ndau ubiqui tylation 

RT complex by 02-regulated prolyl hydroxy! ati on. " ; 

RL Science 292:468-472(2001). 

RN [15] 

RP S-NITROSYLATION . 

RX MEDLINE=22448624; PubMed=12560087; DOI=10.1016/s0014-5793(02)03887-5; 

RA Sumbayev V.V. , Budde A., Zhou 3 . , Bruene B.; 

RT "HIF-l alpha protein as a target for s-ni trosation. "; 

RL FEBS Lett. 535:106-112(2003). 

RN [16] 

RP ACETYLATION OF LYS-532. 

RX MEDLINE=22351901; PubMed=12 464182 ; DOI=10. 1016/S0092-8674(02)0108S-1; 

RA Jeong J.~W., Bae M.-K., Ahn M.-Y., Kim S.-H., Sohn T.-K., Bae M.-H., 

RA Yoo M.-A., Song E.-J., Lee K.-J., Kim K.-w.; 

RT "Regulation and destabilization of HIF-lalpha by ARDl-medi ated 

RT acetyl ati on. " ; 

RL Cell 111:709-720(2002). 

RN [17] 

RP HYDROXYLATION OF ASN-803. 

RX MEDLINE=22074910; PubMed=12080085 ; DOI=10. 110 1/gad. 991402; 

RA Lando D. ( Peet D.J., Gorman 3.3., Whelan D.A. , Whitelaw M.L., 

RA Bruick R.K. ; 

RT "FIH-1 is an asparaginyl hydroxylase enzyme that regulates the 

RT transcriptional activity of hypoxia-inducible factor."; 

RL Genes Dev. 16:1466-1471(2002). 

RN [18] 
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RP HYDROXYLATION OF PRO-402 AND PRO- 564. 

RX MEDLINE=21558830; PubMed=11598268; DOI=10.1126/science. 1066373; 

RA Bruick R.K., McKnight S.L.; 

RT "A conserved family of prolyl -4-hydroxyl ases that modify HIF. " ; 

RL Science 294:1337-1340(2001). 

RN [19] 

RP REVIEW. 

RX MEDLINE=20407247; PubMed=10950862 ; 

RA Semenza G. L. ; 

RT "HIF-1 and human disease: one highly involved factor."; 

RL Genes Dev. 14:1983-1991(2000). 

RN [20] 

RP 3D-STRUCTURE MODELING. 

RX MEDLINE=20539371; PubMed=11089639 ; 

RA Michel G. , Mi net E. , Ernest I., Roland I., Durant F. , Remacle 3 . , 

RA Michiels c. ; 

RT "A model for the complex between the hypoxia-inducible factor-1 (HIF- 

RT 1) and its consensus DNA sequence."; 

RL 3 . Biomol. struct. Dyn. 18:169-179(2000). 

RN [21] 

RP X-RAY CRYSTALLOGRAPHY (2.15 ANGSTROMS) OF 775-826 IN COMPLEX WITH 

RP HIF1AN. 

RX MEDLINE=22412289; PubMed=12 446723 ; DOI=10.1074/ibc.C200644200; 

RA Elkins D.M., Hewitson K.S., McNeill L.A., Seibel J.F., 

RA schlemminger I., Pugh C.W. , Ratcliffe P. J., schofield C.J.; 

RT "structure of factor-inhibiting hypoxia-inducible factor (HIF) reveals 

RT mechanism of oxidative modification of HIF-1 alpha."; 

RL 3 . Biol. Chem. 278:1802-1806(2003). 

RN [22] 

RP STRUCTURE BY NMR OF 786-826 IN COMPLEX WITH 302-418 OF EP300. 

RX MEDLINE=21957254; PubMed=11959990 ; DOI=10. 1073/pnas. 082117899; 

RA Freedman S.J., Sun Z.-Y.J., Poy F., Kung A.L., Livingston D.M., 

RA Wagner G. , Eck M.J.; 

RT "Structural basis for recruitment of CBP/p300 by hypoxia-inducible 

RT factor-1 alpha."; 

RL Proc. Natl. Acad. Sci . U.S.A. 99:5367-5372(2002). 

RN [23] 

RP STRUCTURE BY NMR OF 776-826 IN COMPLEX WITH 345-439 OF CREBBP. 

RX MEDLINE=21957241; PubMed=11959977 ; DOI=10. 1073/pnas. 082121399; 

RA Dames S.A., Marti nez-Yamout M., De Guzman R.N., Dyson H.J., 

RA Wright P.E. ; 

RT "structural basis for Hif-1 alpha /CBP recognition in the cellular 

Query Match 100.0%; Score 4287; DB 1; Length 826; 

Best Local Similarity 100.0%; Pred. No. 7.5e-208; 

Matches 826 ; Conservati ve 0 ; Mi smatches 0 ; Indel s 0 ; Gaps 0 ; 
Qy 1 MEGAGGANDKKKISSERRKEKSRDAARSRRSKESEVFYELAHQLPLPHNVSSHLDKASVM 60 

1 1 1 I 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1! 1 1 ! 1 1 1 1 1 1 1 1 I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I 

Db 1 MEGAGGANDKKKISSERRKEKSRDAARSRRSKESEVFYELAHQLPLPHNVSSHLDKASVM 60 

Qy 61 RLTISYLRVRKLLDAGDLDIEDDMKAQMNCFYLKALDGFVMVLTDDGDMIYISDNVNKYM 120 

I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I 1 I I I II II I M I II I I I I I I I I I 
Db 61 RLTISYLRVRKLLDAGDLDIEDDMKAQMNCFYLKALDGFVMVLTDDGDMIYISDNVNKYM 120 

Qy 121 GLTQFELTGHSVFDFTHPCDHEEMREMLTHRNGLVKKGKEQNTQRSFFLRMKCTLTSRGR 180 

I II I I II I 1 1 II 1 1 I I 1 1 1 1 1 1 I MM I 1 1 I I 1 1 1 1 I I 1 1 I IN 1 1 1 1 I II I 1 1 1 1 I I 1 1 

Db 121 GLTQFELTGHSVFDFTHPCDHEEMREMLTHRNGLVKKGKEQNTQRSFFLRMKCTLTSRGR 180 

Qy 181 TMNIKSATWKVLHCTGHIHVYDTNSNQPQCGYKKPPMTCLVLICEPIPHPSNIEIPLDSK 240 

I II I I II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 III 1 1 1 1 1 1 II 1 1 I II 1 1 1 M M I III 1 1 1 1 1 1 1 1 1 

Db 181 TMNIKSATWKVLHCTGHIHVYDTNSNQPQCGYKKPPMTCLVLICEPIPHPSNIEIPLDSK 240 
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Qy 


241 


TFLSRHSLDMKFSYCDERITELMGYEPEELLGRSIYEYYHALDSDHLTKTHHDMFTKGQV 


300 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 i 




Db 


241 


TFLSRHSLDMKFSYCDERITELMGYEPEELLGRSIYEYYHALDSDHLTKTHHDMFTKGQV 


300 


Qy 


301 


TTGQYRMLAKRGGYVWVETQATVIYNTKNSQPQCIVCVNYWSGIIQHDLIFSLQQTECV 


360 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 I 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

(lllllllillllllllllllltillllllllllllillfllllllllllllllllllil 




Db 


301 


TTGQYRMLAKRGGYVWVETQATVIYNTKNSQPQCIVCVNYWSGIIQHDLIFSLQQTECV 


360 


Qy 


361 


LKPVESSDMKMTQLFTKVESEDTSSLFDKLKKEPDALTLLAPAAGDTIISLDFGSNDTET 


420 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


361 


LKPVESSDMKMTQLFTKVESEDTSSLFDKLKKEPDALTLLAPAAGDTIISLDFGSNDTET 


420 


Qy 


421 


DDQQLEEVPLYNDVMLPSPNEKLQNINLAMSPLPTAETPKPLRSSADPALNQEVALKLEP 


480 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
i i i i i i i i i i i i i i i i i i i i i i i i i i I i i i i i i t i i i i i i i i i i i i i i i i i 1 i i i i i i i i 




Db 


421 


DDQQLEEVPLYNDVMLPSPNEKLQNINLAMSPLPTAETPKPLRSSADPALNQEVALKLEP 


480 


Qy 


481 

~ \J J- 


NPESLFLSFTMPOIODOTPSPSDGSTROSSPEPNSPSEYCFYVDSDMVNEFKLELVEKLF 


540 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 1 1 1 1 I 1 1 1 1 1 ( 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 




Db 


481 


NPESLELSFTMPQIQDQTPSPSDGSTRQSSPEPNSPSEYCFYVDSDMVNEFKLELVEKLF 


540 


Qy 


541 


AEDTEAKNPFSTQDTDLDLEMLAPYIPMDDDFQLRSFDQLSPLESSSASPESASPQSTVT 


600 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 
i i i i i i i i i i i i i i i i 1 i i i i i i i i i i i i i t i i i i i i i i i i i i i i i 1 i i i i i i ■ i i i i i i 




Db 


541 


AEDTEAKNPFSTQDTDLDLEMLAPYIPMDDDFQLRSFDQL5 PLESSSAS PESAS PQSTVT 


600 


Qy 


601 


VFQQTQIQEPTANATTTTATTDELKTVTKDRMEDIKILIASPSPTHIHKETTSATSSPYR 


660 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


601 


VFQQTQIQEPTANA 1 1 1 1 ATTDELKTVTKDRMEDIKILIASPSPTHIHKETTSATSSPYR 


660 


Qy 


661 


DTQSRTASPNRAGKGVIEQTEKSHPRSPNVLSVALSQRTTVPEEELNPKILALQNAQRKR 


720 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
i i i i i i i i i i i i i t i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i t i i i i i i i i i i i i 




Db 


661 


DTQSRTASPNRAGKGVIEQTEKSHPRSPNVLSVALSQRTTVPEEELNPKILALQNAQRKR 


720 


Qy 


721 


KMEHDGSLFQAVGIGTLLQQPDDHAATTSLSWKRVKGCKSSEQNGMEQKTIILIPSDLAC 


780 




1 1 1 1 1 I 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1! 1 1 1 1 1 1 ! 1 1 1 1 1 I 1 II Ml II 1 II 




Db 


721 


KMEHDGSLFQAVGIGTLLQQPDDHAATTSLSWKRVKGCKSSEQNGMEQKTIILIPSDLAC 


780 


Qy 


781 


RLLGQSMDESGLPQLTSYDCEVNAPIQGSRNLLQGEELLRALDQVN 826 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ll 1 II 1 1 1 il 1 
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GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 



July 27, 2005, 07:32:56 ; Search time 14.1136 Seconds 

(without alignments ) 
1479.779 Million cell updates/sec 

US-09-922-958-5 
275 

1 KLELVEKL FAEDTEAKN P FS I PMDDDFQLRSFDQLSPLES 54 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 2105692 seqs, 386760381 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : 



2105692 



A_Geneseq_16Dec04 : * 

1: geneseqpl980s : * 

2: geneseqpl990s : * 

3: geneseqp2000s : * 

4: geneseqp2001s : * 

5: geneseqp2002s : * 

6: geneseqp2003as : * 

7: geneseqp2003bs : * 

8 : geneseqp2004s : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 
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Length 


DB 


ID 


Description 
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ALIGNMENTS 



RESULT 1 
AAO23490 

ID AAO23490 standard; peptide; 54 AA. 
XX 

AC AAO23490; 
XX 

DT 12-FEB-2004 (first entry) 
XX 

DE Murine HIF-lalpha protein N-TAD region fragment (residues 531-584) . 
XX 

KW HIF-lalpha; hypoxia-inducible factor -lalpha; vasotropic; antidiabetic; 

KW ophthalmological ; antiinflammatory; cardiant; antirheumatic; mouse; 

KW antiarthritic; cerebroprotective ; angiogenesis stimulator; gene therapy; 

KW N-TAD. 

XX 

OS Mus sp. 



XX 

PN WO2003074560-A2 . 
XX 

PD 12-SEP-2003. 
XX 

PF 05-MAR-2003; 2003WO-SE000372 . 
XX 

PR 05-MAR-2002; 2002US-0361333P . 

PR 27-NOV-2002; 2002US-0429307P . 
XX 

PA (ANGI-) ANGIOGENETICS SWEDEN AB . 
XX 

PI Pereira T, Poellinger L, Hellstroem M; 
XX 

DR WPI;. 2003-712876/67. 
XX 

PT New hypoxia-inducible factor (HIF)-lalpha protein, useful for increasing 

PT angiogenesis, or treating a condition associated with HIF-lalpha 

PT underexpression in a cell, a group of cells, or an organism, e.g. 

PT ischemia or inflammation. 

XX 

PS Example 4;. Fig 4; 96pp; English. 
XX 

CC The invention relates to a hypoxia-inducible factor (HIF)-lalpha protein 

CC that has (a) an altered transactivation capacity and improved stability 

CC at normoxia. The HIF-lalpha protein, polynucleotide, vector, and 

CC pharmaceutical composition are useful for increasing angiogenesis , 

CC interfering with a normal response to reoxygenation following hypoxia, or 

CC treating a condition associated with HIF-lalpha underexpression in a 

CC cell, a group of cells, or an organism, e.g. ischaemia, diabetic 

CC retinopathy, inflammation, coronary heart disease, rheumatoid arthritis, 

CC stroke. The proteins and pharmaceutical compositions are also useful for 

CC mimicking the hypoxic response or artificially inducing a hypoxic 

CC response in a cell, group of cells, or organism, inducing vascular 

CC formation or vascular development in a cell or a group of cells , 

CC increasing angiogenetic activity in a cell, or influencing erythropoietin 

CC production, metabolism, or an inflammatory response in a cell, a group of 

CC cells, or an organism. The present sequence represents a N-TAD region of 

CC a murine HIF-1 alpha protein 

XX 

SQ Sequence 54 AA; 

Query Match 100.0%; Score 275; DB 7; Length 54; 
Best Local Similarity 100.0%; Pred. No. l.le-28; 

Matches 54; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 KLELVEKLFAEDTEAKNPFSTQDTDLDLEMLAPYIPMDDDFQLRSFDQLSPLES 54 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 KLELVEKLFAEDTEAKNPFSTQDTDLDLEMLAPYIPMDDDFQLRSFDQLSPLES 54 



RESULT 2 
AAY94632 

ID AAY94632 standard; protein; 116 AA. 
XX 

AC AAY94 632; 
XX 



DT 15-AUG-2000 (first entry) 
XX 

DE HIF-lalpha variant protein sequence HIF-lalpha/52 6-64 1 . 
XX 

KW Hypoxia-inducible factor lalpha; HIF-lalpha; PAS-B; N-TAD; C-TAD; - 

KW regulation; angiogenesis ; erythropoiesis ; glycolysis; human. 

XX 

OS Homo sapiens. 
XX 

PN WO200029437-A1. 
XX 

PD 25-MAY-2000. 
XX 

PF ll-NOV-1999; 99WO-SE002053 . 
XX 

PR 13-NOV-1998; 98SE-00003891 . 
XX 

PA (PHAA ) PHARMACIA & UPJOHN AB. 
XX 

PI Berkenstam A, Poellinger L; 
XX • 

DR WPI; 2000-399715/34. 
XX 

PT Human hypoxia-inducible factor alpha variants for identifying compounds 

PT that modulate its functional domain and regulate genes involved in 

PT angiogenesis, erythropoiesis. 
XX 

PS Claim 13; Page 76-77; 87pp; English. 
XX 

CC This sequence represents a fragment of the hypoxia-inducible factor (HIF) 

CC -lalpha amino acid sequence. The mechanism of action of HIF-lalpha is a 

CC multi-step process which includes hypoxia-dependent nuclear import and 

CC activation of the transactivation domain. The HIF-lalpha consists of a 

CC number of functional domains including a PAS-B (Per, Arnt, Sim) domain 

CC located in human HIF-lalpha between amino acids 173 and 390, a C-terminal 

CC nuclear localization sequence located at amino acids 718-584, a 

CC transactivator domain (N-TAD) located between amino acids 531 and 584, 

CC and a second transactivator domain (C-TAD) located between 813 and 826. 

CC The invention relates to isolated variants of HIF-lalpha, such as that 

CC represented by the present sequence. The variants are useful for 

CC identifying compounds capable of modulating the function of a functional 

CC domain of human HIF-lalpha. The method comprises contacting a candidate 

CC compound with a cell expressing a HIF-lalpha variant conjugated to a 

CC molecular probe. The localization of the probe can be detected in the 

CC cell. The Aequeora victoria green fluorescent protein can be used as the 

CC molecular probe. The compounds are useful for the regulation of HIF- 

CC lalpha target genes, such as those involved in the regulation of 

CC angiogenesis, erythropoiesis an glycolysis 
XX 

SQ Sequence 116 AA; 

Query Match 100.0%; Score 275; DB 3; Length 116; 
Best Local Similarity 100.0%; Pred. No. 2.8e-28; 

Matches 54; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 



1 KLELVEKLFAEDTEAKNPFSTQDTDLDLEMLAPYIPMDDDFQLRSFDQLSPLES 54 
I I II ! I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I 



KLELVEKLFAEDTEAKNPFSTQDTDLDLEMLAPYIPMDDDFQLRSFDQLSPLES 60 



us-09-922-958-5.rapb 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence: 

Scoring table: 



Searched: 



July 27, 2005, 07:53:38 



US-09-922-958-5 
275 

1 KLELVEKLFAEDTEAKNPFS. 



Search time 13.3159 Seconds 
(without alignments) 
1577.480 Million cell updates/sec 



, IPMDDDFQLRSFDQLSPLES 54 



8LOSUM62 
Gapop 10.0 



Gapext 0.5 



1741741 seqs, 388992284 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



1741741 



Post-processing : 



Database 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 

Publ i shed^Appl i cati ons_AA : 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 



/cgn2_6/ptodata/l/pubpaa/US07_PUBCOMB . pep : * 
/cgn2_6/ptodata/l/pubpaa/PCT_NEW_PUB . pep : * 
/cgn2_6/ptodata/l/pubpaa/US06_NEW_PUB . pep : * 
/cgn2_6/ptodata/l/pubpaa/US06_PUBCOMB . pep : * 
/cgn2_6/ptodata/l/pubpaa/US07_NEW_PUB . pep : * 
/cgn2_6/ptodata/l/pubpaa/PCTUS_PUBCOMB . pep : 
/cgn2_6/ptodata/l/pubpaa/US08_NEW_PUB . pep : * 
/cgn2_6/ptodata/l/pubpaa/US08_PUBCOMB . pep : * 
/cgn2_6/ptodata/l/pubpaa/US09A_PUBCOMB . pep : 
/cgn2_6/ptodata/l/pubpaa/US09B_PUBCOMB . pep 
/cgn2_6/ptodata/l/pubpaa/US09C_PUBCOMB . pep 
/cgn2_6/ptodata/l/pubpaa/US09_NEW_PUB . pep : 
/cgn2_6/ptodata/l/pubpaa/USl0A^PUBCOMB . pep 
/cgn2_6/ptodata/l/pubpaa/USl0B_PUBCOMB . pep 
/cgn2_6/ptodata/l/pubpaa/USl0c„PUBCOMB . pep 
/cgn2_6/ptodata/l/pubpaa/USl0D_PUBCOMB . pep 
/cgn2__6/ptodata/l/pubpaa/USl0E_PUBCOMB . pep 
/cgn2_6/ptodata/l/pubpaa/USl0_NEW„PUB . pep : 
/cgn2_6/ptodata/l/pubpaa/USllA_PUBCOMB . pep 
/cgn2_6/ptodata/l/pubpaa/USll_NEW_PUB . pep : 
/cgn2_6/ptodata/l/pubpaa/US60_NEW_PUB . pep : 
/cgn2_6/ptodata/l/pubpaa/US60_PUBCOMB . pep : 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result Query 

No. Score Match Length DB ID 



Description 
Sequence 5, Appli 
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us-09-922-958-5.rapb 



2 


275 


100 


o 


409 


15 


US-10-425-833-8 


3 


275 


100 


o 


466 


15 


US-10-425-833-9 


4 


275 


100 


0 


538 


15 


US-10-425-833-6 


5 


275 


100 


0 


542 


15 


US-10-264-049-2606 


6 


275 


100 


0 


595 


15 


US-10-425-833-7 


7 


275 


100 


o 


735 


16 


US-10-032-361-2 


8 


275 


100 


o 


826 


9 


US-09-922-958-4 


9 


275 


100 


o 


826 


9 


US-09-833-790-235 


10 


275 


100 


o 


826 


9 


US-09-736-457-330 


11 


275 


100 


o 


826 


9 


US-09-902-941-330 


12 


275 


100 


o 


826 


9 


US-09-849-626-330 


13 


275 


100 


o 


826 


10 


US-09-967-388-4 


14 


275 


100 


o 


826 


10 


US-09-476-300-330 


15 


275 


100 


o 


826 


13 


US- 10-02 8-15 8-2 3 


16 


275 


100 


o 


826 


13 


US- 10-101-812 -10 

v ■>»> vy ^ \y i/^c. JLvy 


17 


275 


100 


o 


826 


14 


US-10-101-662A-9 


18 


275 


100 


o 


826 


14 


US-10-017-754-330 


19 


275 


100 


o 


826 


14 


US-10-115-987B-14 


20 


275 


100 


o 


826 


14 


US-10-287-670-9 


21 


275 


100 


o 


826 


14 


US-10-113-872-330 


22 


275 


100 


o 


826 


14 


US-10-423-419-2 


23 


275 


100 


o 


826 


15 


US-10-283-017-330 


24 


275 


100 


o 


826 


16 


US-10-032-361-1 


25 


275 


100 


o 


826 


16 


US- 10-8 5 9-93 5-10 


26 


275 


100 


o 


826 


16 


US-10-859-935-23 


27 


275 


100 


o 


826 


17 


US- 10-901- 583-18 


28 


275 


100 


o 


826 


18 


US-10-472-595A-35 


29 


275 


100 


o 


826 


20 


US-11-043-493-23 


30 


275 


100 


o 


827 


10 


US-09-919-039-149 


31 


275 


100 


o 


827 


14 


US-10-2 47-671- 137 


32 


275 


100 


o 


836 


18 


US-10-988-486-6 


33 


270 


98 


2 


823 


14 


US-10-205-342-13 


34 


270 


98 


2 


826 


14 


US-10-101-816-5 


35 


270 


98 


2 


826 


16 


US-10-854-483-5 


36 


267 


97 


1 


826 


14 


US-10-101-816-2 


37 


267 


97 


1 


826 


16 


US-10-854-483-2 


38 


264 


96 


0 


632 


15 


US-10-425-833-10 


39 


262 


95 


3 


826 


14 


US-10-101-816-6 


40 


262 


95 


3 


826 


14 


US-10-101-816-7 


41 


262 


95 


3 


826 


16 


US-10-854-483-6 


42 


262 


95 


3 


826 


16 


US-10-854-483-7 


43 


180 


65 


5 


34 


17 


US-10-901-583-9 


44 


180 


65 


5 


34 


18 


US-10-472-595A-16 


45 


180 


65 


5 


34 


18 
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Sequence 8, Appli 
Sequence 9, Appli 
Sequence 6 f Appli 
Sequence 2606, Ap 
Sequence 7, Appli 
Sequence 2, Appli 
Sequence 4, Appli 
Sequence 235, App 
Sequence 330, App 
Sequence 330, App 
Sequence 330, App 
Sequence 4, Appli 
Sequence 330, App 
Sequence 23, Appl 
Sequence 10, Appl 
Sequence 9, Appli 
Sequence 330, App 
Sequence 14, Appl 
Sequence 9, Appli 
Sequence 330, App 
Sequence 2, Appli 
Sequence 330, App 
Sequence 1, Appli 
Sequence 10, Appl 
Sequence 23, Appl 
Sequence 18, Appl 
Sequence 35, Appl 
Sequence 23, Appl 
Sequence 149, App 
Sequence 137, App 
Sequence 6, Appli 
Sequence 13, Appl 
Sequence 5 , App li 
Sequence 5 , Appli 
Sequence 2, Appli 
Sequence 2, Appli 
Sequence 10, Appl 
Sequence 6, Appli 
Sequence 7, Appli 
Sequence 6, Appli 
Sequence 7, Appli 
Sequence 9, Appli 
Sequence 16, Appl 
Sequence 28, Appl 



ALIGNMENTS 



RESULT 1 
US-09-922-958-5 

; sequence 5, Application US/09922958 

; Patent No. US20020048794A1 

; GENERAL INFORMATION: 

; APPLICANT: POELLINGER, Lorenz 

; APPLICANT: PEREIRA, Teresa 

; APPLICANT: ruas, Jorge 

; TITLE OF INVENTION: MECHANISM OF CONDITIONAL REGULATION OF THE HYPOXIA-INDUCIBLE 
FACTOR- 1 BY 

; TITLE OF INVENTION: THE VON HIPPEL-LINDAU TUMOR SUPPRESSOR PROTEIN 

; FILE REFERENCE: 3743/49008 

; CURRENT APPLICATION NUMBER: US/09/922 , 958 

; CURRENT FILING DATE: 2001-08-07 



Page 2 



us-09-922-958-5.rapb 
; PRIOR APPLICATION NUMBER: US 60/223,480 
; PRIOR FILING DATE: 2000-08-07 
; NUMBER OF SEQ ID NOS: 7 
; SOFTWARE: Patentln version 3.0 
; SEQ ID NO 5 
; LENGTH: 54 
; TYPE: PRT 

; ORGANISM: Homo sapiens 
US-09-922-958-5 

Query Match 100.0%; Score 275; DB 9; Length 54; 

Best Local Similarity 100.0%; Pred. No. 4e-27; 

Matches 54; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 KLELVEKLFAEDTEAKNPFSTQDTDLDLEMLAPYIPMDDDFQLRSFDQLSPLES 54 

I I 1 1 Ml I II 1 1 I I 1 1 I I I I 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I II I 1 1 il I ! 1 1 I I 1 1 

Db. 1 KLELVEKLFAEDTEAKNPFSTQDTDLDLEMLAPYIPMDDDFQLRSFDQLSPLES 54 



RESULT 2 
US-10-425-833-8 

; Sequence 8, Application US/10425833 

; Publication No. US20040018606A1 

; GENERAL INFORMATION: 

; APPLICANT: Bohl , Del phi ne 

; APPLICANT: Heard, Jean Michael 

; TITLE OF INVENTION: Control of protein systemic delivery of hypoxia using a 
tet-HIFl-alphja 

; TITLE OF INVENTION: chimeric transacti vator 
; FILE REFERENCE: 235748US0 

; CURRENT APPLICATION NUMBER: US/10/42 5 , 833 

; CURRENT FILING DATE: 2003-04-30 

; PRIOR APPLICATION NUMBER: US 60/376,269 

; PRIOR FILING DATE: 2002-04-30 

; NUMBER OF SEQ ID NOS: 19 

; SOFTWARE: Patentln version 3.1 
; SEQ ID NO 8 
; LENGTH: 409 
; TYPE: PRT 

; ORGANISM: ARTIFICIAL SEQUENCE 
; FEATURE: 

; OTHER INFORMATION: SYNTHETIC PEPTIDE 
US-10-425-833-8 

Query Match 100.0%; Score 275; DB 15; Length 409; 

Best Local Similarity 100.0%; Pred. No. 4.4e-26; 

Matches 54; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 KLELVEKLFAEDTEAKNPFSTQDTDLDLEMLAPYIPMDDDFQLRSFDQLSPLES 54 

I I I I II I M II I I I I I I I I I I I I I I I I I I I I I I II I I I III I I I I I I M I I I I 

Db 211 KLELVEKLFAEDTEAKNPFSTQDTDLDLEMLAPYIPMDDDFQLRSFDQLSPLES 264 



RESULT 3 
US-10-425-833-9 

; Sequence 9, Application US/10425833 

; Publication No. US20040018606A1 

; GENERAL INFORMATION: 

; APPLICANT: Bohl, Del phi ne 

; APPLICANT: Heard, Jean Michael 

; TITLE OF INVENTION: Control of protein systemic delivery of hypoxia using a 
tet-HIFl-alphja 

; TITLE OF INVENTION: chimeric transacti vator 



Page 3 



us-09-922-958-5.rapb 

; FILE REFERENCE: 235748US0 

; CURRENT APPLICATION NUMBER: US/10/42 5 , 833 

; CURRENT FILING DATE: 2003-04-30 

; PRIOR APPLICATION NUMBER: US 60/376,269 

; PRIOR FILING DATE: 2002-04-30 

; NUMBER OF SEQ ID NOS: 19 

; SOFTWARE: Patentln version 3.1 

; SEQ ID NO 9 

; LENGTH: 466 

; TYPE: PRT 

; ORGANISM: ARTIFICIAL SEQUENCE 
; FEATURE: 

; OTHER INFORMATION: SYNTHETIC PEPTIDE 
US-10-425-833-9 

Query Match 100.0%; Score 275; DB 15; Length 466; 

Best Local similarity 100.0%; Pred. No. 5.1e-26; 

Matches 54; Conservative 0; Mismatches 0; Indels 0; Gaps 
Qy 1 KLELVEKLFAEDTEAKNPFSTQDTDLDLEMLAPYIPMDDDFQLRSFDQLSPLES 54 

1 1 I I 1 1 1 1 1 1 I I! I 1 1 II I I 1 1 1 1 1 1 I I 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 1 II 

Db 211 KLELVEKLFAEDTEAKNPFSTQDTDLDLEMLAPYIPMDDDFQLRSFDQLSPLES 264 
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GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 
Perfect score: 275 



July 27, 2005, 07:42:11 ; Search time 3.19091 Seconds 

(without alignments) 
1628.284 Million cell updates/sec 

US-09-922-958-5 



Sequence : 



1 KLELVEKLFAEDTEAKNPFS I PMDDDFQLRSFDQLSPLES 54 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 283416 seqs, 96216763 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



283416 



Database 



PIR_79: * 
pirl : 
pir2 : 1 
pir3 : '• 
pir4 : 1 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
JC4837 

hypoxia -inducible factor 1 alpha - mouse 
C; Species: Mus musculus (house mouse) 

C;Date: 15-Aug-1996 #sequence_revision 15-Oct-1996 #text_change 09-Jul-2004 
C; Accession : JC4 837 

R;Wenger, R.H.; Rolfs, A.; Marti, H.H.; Guenet, J.L.; Gassmann, M. 
Biochem. Biophys . Res. Commun. 223, 54-59, 1996 

A; Title: Nucleotide sequence, chromosomal assignment and mRNA expression of 
mouse hypoxia- inducible f actor-1 alpha . 

A; Reference number: JC4837; MUID: 96254028; PMID:8660378 
A; Access ion: JC4 837 
A; Molecule type: mRNA 
A; Residues: 1-810 <WEN> 

A;Cross-references : UNIPROT : Q6122 1 ; EMBL : X95580 ; NID: gl430864 ; PIDN : CAA64 833 . 1 ; 
PID:g4379202 

C; Comment : This factor is involved in the oxygen- regulated transcription of 

several genes including erythropoietin. 

C; Genetics : 

A; Gene : Hif lalpha 



A; Map position: 12 

C; Keywords: transcription factor 

F; 5-5 8 /Region : helix-loop-helix #status predicted 

Query Match 100.0%; Score 275; DB 2; Length 810; 

Best Local Similarity 100.0%; Pred. No. 2.2e-25; 

Matches 54; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 KLELVEKLFAEDTEAKNPFSTQDTDLDLEMLAPYI PMDDDFQLRSFDQLSPLES 54 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 519 KLELVEKLFAEDTEAKNPFSTQDTDLDLEMLAPYIPMDDDFQLRSFDQLSPLES 572 



RESULT 2 
138972 

hypoxia-inducible factor 1 alpha - human 
N;Alternate names: ARNT interacting protein 
C; Species: Homo sapiens (man) 

C;Date: 23-Feb-1996 #sequence_revision 23-Feb-1996 #text_change 09-Jul-2004 
C;Accession: 138972; G01875 

R;Wang, G.L.; Jiang, B.H.; Rue, E.A.; Semenza, G.L. 
Proc. Natl. Acad. Sci. U.S.A. 92, 5510-5514, 1995 

A; Title: Hypoxia-inducible factor 1 is a basic-helix-loop-helix-PAS heterodimer 
regulated by cellular 02 tension. 

A; Reference number: 138972; MUID: 95296340; PMID:7539918 
A; Access ion: 138972 
A; Status : preliminary 
A; Molecule type: mRNA 
A; Residues: 1-826 <RES> 

A; Cross-references : UNIPROT : Q16665 ; EMBL:U22431; NID:g881345; PIDN : AAC50152 . 1 ; 
PID:g881346 

A;Note: parts of this sequence were confirmed by peptide sequencing 
R;Hogenesch, J.B.; Chan, W.K.; Carver, L.A.; Bradfield, C.A. 
submitted to the EMBL Data Library, June 1995 
A; Reference number: H00692 
A; Accession: G0187 5 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A; Residues: 1-826 <HOG> 

A; Cross-references : EMBL:U29165; NID : gll44012 ; PIDN : AAC51210 . 1-; PID:gll44013 

C; Genetics: 

A; Gene: GDB : HIF1A 

A; Cross-references : GDB: 512229 

A; Map position: 14q21-14q24 

C; Keywords: heterodimer 

Query Match 100.0%; Score 275; DB 2; Length 826; 

Best Local Similarity 100.0%; Pred. No. 2.3e-25; 

Matches 54; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 KLELVEKLFAEDTEAKNPFSTQDTDLDLEMLAPYI PMDDDFQLRSFDQLSPLES 54 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I r 
Db 532 KLELVEKLFAEDTEAKNPFSTQDTDLDLEMLAPYI PMDDDFQLRSFDQLSPLES 585 



RESULT 3 
JC5809 



hypoxia-inducible factor 1 alpha - rat 
C; Species: Rattus norvegicus (Norway rat) 

C;Date: 04-Feb-1998 #sequence_revision 13-Mar-1998 #text_change 26-Aug-1999 
C; Accession: JC5809 
R;Ladoux, A.; Frelin, C. 

Biochem. Biophys . Res. Commun. 240, 552-556, 1997 

A;Title: Cardiac expressions of HIF-1 alpha and HLF/EPAS, two basic loop 
helix/PAS domain transcription factors involved in adaptative responses to 
hypoxic stresses, 

A; Reference number: JC5809; MUID : 98063274 ; PMID: 9398602 
A; Accession: JC5809 

A; Status: nucleic acid sequence not shown 
A; Molecule type: mRNA 
A; Residues: 1-813 <LAD> 

C; Comment: This protein associates to the aryl hydrocarbon receptor nuclear 
translocator , binds to specific hypoxia responsive elements and activates the 
transcription of hypoxia sensitive genes such as erythropoietin, vascular 
endothelial growth factor, the GLUT1 glucose transporter, enzymes of gycolysis 
and the inducible form of nitric oxyde synthase. 
F; 6-144/Region : basic helix-loop-helix #status predicted 

Query Match 98.2%; Score 270; DB 2; Length 813; 

Best Local Similarity 98.1%; Pred. No. 9e-25; 

Matches 53 ; Conservative 0 ; Mismatches 1 ; Indels 0 ; Gaps 0 ; 

Qy 1 KLELVEKLFAEDTEAKNPFSTQDTDLDLEMLAPYIPMDDDFQLRSFDQLSPLES 54 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I II I 
Db 519 KLELVEKLFAEDTEAKNPFSAQDTDLDLEMLAPYIPMDDDFQLRSFDQLSPLES 572 



us-09-922-958-5.rup 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 

Run on: July 27, 2005, 07:33:41 ; Search time 13.9295 Seconds 

(without alignments) 
1985.153 Million cell updates/sec 

Title: US-09-922-958-5 
Perfect score: 275 

Sequence: 1 KLELVEKLFAEDTEAKNPFS IPMDDDFQLRSFDQLSPLES 54 

scoring table: 8LOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 1612378 seqs, 512079187 residues 

Total number of hits satisfying chosen parameters: 1612378 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : Uni Prot_03 : * 

1: uni prot_sprot : * 
2 : uni prot_trembl : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
Q6SLL1 

ID Q6SLL1 PRELIMINARY; PRT; 786 AA. 

AC Q6SLL1; 

DT 05-JUL-2004 (TrEMBLrel . 27, Created) 

DT 05-JUL-2004 (TrEMBLrel. 27, Last sequence update) 

DT 05-JUL-2004 (TrEMBLrel. 27, Last annotation update) 

DE Hypoxia-inducible factor 1 alpha subunit (Fragment). 

OS cam's familiaris (Dog). 

oc Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

oc Mammalia; Eutheria; Carnivora; Fissipedia; canidae; Cam's, 

ox NCBl_TaxlD=9615; 

RN [1] 

RP SEQUENCE FROM N . A. 

RC TissuE=Liver; 

RA Spee B., Penning L.C., Rothuizen J.; 

RL Submitted (OCT-2003) to the EMBL/GenBank/DDBJ databases. 

DR EMBL ; AY455802; AAR19225.1; 

DR GO; GO:0005634; C:nucleus; IEA. 

DR GO; GO:0004871; Fisignal transducer activity; IEA. 

DR GO; GO:0003700; F: transcri ption factor activity; IEA. 

DR GO; GO:0006355; P: regulation of transcription, DNA-dependent ; IEA. 

DR GO; GO:0007165; P:signal transduction; IEA. 

DR InterPro; IPR001092; HLH_basic. 

DR InterPro; IPR001321; HypoxindFlA. 

DR InterPro; IPR001610; PAC 

DR InterPro; IPR000014; PAS. 

DR Pfam; PF00785; PAC; 1. 

DR Pfam; PF00989; PAS; 2. 

DR PRINTS; PR01080; HYPOXIAIFlA. 

DR SMART; SM00353; HLH; 1. 

DR SMART; SM00086; PAC; 1. 

DR SMART; SM00091; PAS; 2. 

DR PROSITE; PS50112; PAS; 2. 
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FT NON_TER 
FT NON„TER 
SQ SEQUENCE 



us-09-922-958-5. rup 

1 1 
786 786 

786 AA; 88015 MW; C37A27C25C343CDC CRC64; 



Query Match • 100.0%; Score 275; DB 2; Length 786; 

Best Local similarity 100.0%; Pred. No. 2.7e-24; 

Matches 54; Conservative 0; Mismatches 0; Indels 0; 



Qy 

Db 



Gaps 
54 



0; 



1 KLELVEKLFAEDTEAKNPFSTQDTDLDLEMLAPYIPMDDDFQLRSFDQLSPLES 

I 1 1 I II I 1 1 II ! I 1 1 1 1 I I I I II I 1 1 1 1 1 1 I II II I 1 1 II 1 1 1 1 I I I I I I I 1 1 I 

512 KLELVEKLFAEDTEAKNPFSTQDTDLDLEMLAPYIPMDDDFQLRSFDQLSPLES 565 



RESULT 2 
Q7YSE5 

ID Q7YSE5 PRELIMINARY; PRT; 819 AA. 

AC Q7YSE5; 

DT 01-OCT-2003 (TrEMBLrel . 25, Created) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last sequence update) 

DT 01-MAR-2004 (TrEMBLrel. 26, Last annotation update) 

DE Hypoxia inducible factor 1 alpha subunit. 

OS Oryctolagus cuniculus (Rabbit; . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

oc Mammalia; Eutheria; Lagomorpha; Leporidae; Oryctolagus. 

OX NCBI_TaxID=9986; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Clausen I., Kietz s., Fischer B.; 

RL submitted (APR-2003) to the EMBL/GenBank/DDBJ databases. 

cc -!- SIMILARITY: Contains 1 basic helix-loop-helix (bHLH) domain. 

DR EMBL ; AY273790; AAP43517.1; -. 

DR HSSP; Q16665; 1H2K. 

DR GO; GO:0004871; F:signal transducer activity; IEA. 

DR GO; GO:0006355; P: regulation of transcription, DNA-dependent ; IEA. 

DR GO; GO:0007165; P:signal transduction; IEA. 

DR interPro; IPR001092; HLH_basic. 

DR InterPro; IPR001610; PAC. 

DR InterPro; IPR000014; PAS. 

DR Pfam; PF00010; HLH ; 1. 

DR Pfam; PF00785; PAC; 1. 

DR Pfam; PF00989; PAS; 2. 

DR SMART ; SM00353; HLH; 1. 

DR SMART ; SM00086; PAC; 1. 

DR SMART; SM00091; PAS; 2. 

DR PROSITE; PS50888; HLH; 1. 

DR PROSITE; PS50112; PAS; 2. 

SQ SEQUENCE 819 AA; 91284 MW; E11B4FBF7D4F6C7C CRC64; 

Query Match 100.0%; Score 275; DB 2; Length 819; 

Best Local Similarity 100.0%; Pred. No. 2.8e-24; 

Matches 54; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 KLELVEKLFAEDTEAKNPFSTQDTDLDLEMLAPYIPMDDDFQLRSFDQLSPLES 54 

I I I 1 1 I I 1 1 II I I 1 1 1 1 I I I I Ml 1 1 1 1 1 1 I II II I 1 1 1 1 1 1 1 1 I Ml II I 1 1 I 

Db 532 KLELVEKLFAEDTEAKNPFSTQDTDLDLEMLAPYIPMDDDFQLRSFDQLSPLES 585 
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us-09-922-958-4.rai 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 

Run on: July 27, 2005, 07:43:11 ; Search time 56.3182 Seconds 

(without alignments) 
1094.853 Million cell updates/sec 

Title: US-09-922-958-4 
Perfect score: 4287 

Sequence: 1 MEGAGGANDKKKISSERRKE QGSRNLLQGEELLRALDQVN 826 

scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 513545 seqs, 74649064 residues 

Total number of hits satisfying chosen parameters: 513545 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : lssued_Patents^AA: * 



/cgn2_6/ptodata/l/i aa/5A_COMB . pep : * 
/cgn2_6/ptodata/l/i aa/5B_COMB . pep: * 
/cgn2_6/ptodata/l/i aa/6A_COMB . pep : * 
/cgn2_6/ptodata/l/i aa/6B_COMB . pep : * 
/cgn2_6/ptodata/l/i aa/PCTUS_COMB . pep : * 
/cgn2_6/ptodata/l/i aa/backf i 1 esl. pep: * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result Query 

:ch L 



No. Score Match Length DB ID Description 

1 4287 100.0 826 1 US-08-785-241-6 Sequence 6, Appli 

2 4287 100.0 826 2 US-08-480-473B-2 Sequence 2, Appli 

3 4287 100.0 826 3 US-08-915-213-2 Sequence 2, Appli 

4 4287 100.0 826 3 US-09-148-547-2 Sequence 2, Appli 

5 4287 100.0 826 3 US-09-235-217-2 Sequence 2, Appli 

6 4287 100.0 826 3 US-09-380-662-23 Sequence 23, Appl 

7 4287 100.0 826 4 US-09-438-833-1 Sequence 1, Appli 

8 4287 100.0 826 4 US-09-7O2-7O5-330 Sequence 330, App 

9 4287 100.0 826 4 US-09-736-457-330 Sequence 330, App 

10 4287 100.0 826 4 US-09-383-581-2 Sequence 2, Appli 

11 4287 100.0 826 4 US-09-614-124B-330 Sequence 330, App 

12 4287 100.0 826 4 US-09-671-325-330 Sequence 330, App 

13 4287 100.0 826 4 US-09-589-184-330 Sequence 330, App 

14 4287 100.0 826 4 US-09-658-824-330 Sequence 330, App 

15 4287 100.0 826 4 US-09-959-873B-18 Sequence 18, Appl 

16 4287 100.0 826 4 US-09-949-016-6089 Sequence 6089, Ap 

17 4287 100.0 826 4 US-09-967-388-4 Sequence 4, Appli 
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us-09-922-958-4. rai 


18 


4287 


100 


0 


826 


5 


PCT-US96-10251-2 


19 


4273.5 


99 


7 


827 


4 


US-09-919-039-149 


20 


4224 


98 


5 


813 


4 


US-09-438-833-12 


21 


4156 


96 


9 


805 


2 


US-08-480-473B-4 


22 


4156 


96 


9 


805 


3 


US-08-915-213-4 


23 


4156 


96 


9 


805 


3 


US-09-235-217-4 


24 


4156 


96 


9 


805 


5 


PCT-US96-10251-4 


25 


3939 


91.9 


756 


4 


US-09-438-833-11 


26 


3751 


87 


5 


810 


1 


US-08-785-241-7 


27 


3404 


79 


4 


652 


4 


US-09-438-833-5 


28 


3135 . 5 


73 


1 


623 


4 


US-09-967-388-2 


29 


3021.5 


70 


5 


613 


4 


US-09-438-833-6 


30 


2715 


63 


3 


532 


4 


US-09-949-016-7389 


31 


1942 


45 


3 


373 


2 


US-08-480-473B-3 


32 


1942 


45 


3 


373 


3 


US-08-915-213-3 


33 


1942 


45 


3 


373 


3 


US-09-235-217-3 


34 


1942 


45 


3 


373 


5 


PCT-US96-10251-3 


35 


1764. 5 


41 


2 


870 


1 


US-08-785-241-4 


36 


1764. 5 


41 


2 


870 


3 


US-09-374-454-6 


37 


1762 


41 


1 


330 


4 


US-09-438-833-4 


38 


1720 


40 


1 


875 


1 


US-08-785-241-5 


39 


1579 


36 


8 


311 


4 


US-09-438-833-7 


40 


1522 


35 


5 


301 


4 


US-09-438-833-10 


41 


1459 


34 


0 


288 


4 


US-09-438-833-9 


42 


1458 


34 


0 


485 


3 


US-09-374-454-4 


43 


1301 


30 


3 


245 


4 


US-09-438-833-3 


44 


1165 


27 


2 


238 


4 


US-09-949-016-9161 


45 


896 


20 


9 


1507 


4 


US-09-914-259-37 



Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 



2, Appli 
149, App 
12, Appl 
4, Appli 
4, Appli 
4, Appli 

4, Appli 
11, Appl 
7, Appli 

5, Appli 

2, Appli 

6, Appli 
7389, Ap 

3, Appli 
Appli 
Appli 
Appl i 
Appl i 
Appl i 
Appli 
Appli 
Appli 

Appl 



3, 
3, 
3, 
4, 
6, 
4, 
5, 
7, 
10, 



9, Appli 
4, Appli 
3, Appli 
9161, Ap 
37, Appl 



ALIGNMENTS 



RESULT 1 
US-08-785-241-6 

; Sequence 6, Application US/08785241 

; Patent No. 5695963 

; GENERAL INFORMATION: 

; APPLICANT: McKnight, Steven L. 

; APPLICANT: Russell, David W. 

; APPLICANT: Tian, Hui 

; TITLE OF INVENTION: Endothelial PAS Domain Protein 

; NUMBER OF SEQUENCES: 7 

; CORRESPONDENCE ADDRESS: 

; ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 

; STREET: 268 BUSH STREET, SUITE 3200 

; CITY: SAN FRANCISCO 

; STATE: CALIFORNIA 

; COUNTRY: USA 

; ZIP: 94104 

; COMPUTER READABLE FORM: 

; medium TYPE: Floppy disk 

; COMPUTER: IBM PC compatible 

; OPERATING SYSTEM: PC-DOS/MS-DOS 

; software: Patentln Release #1.0, version #1.30 

; CURRENT APPLICATION DATA: 

; APPLICATION NUMBER: US/08/785 , 241 

; FILING DATE: 17-JAN-1997 

; classification: 435 

; attorney/agent information: 

name: osman, richard a 

; registration number: 36,627 

; reference/docket number: utsd:1229 
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us-09-922-958-4.rai 
; TELECOMMUNICATION INFORMATION: 
; TELEPHONE: (415) 343-4341 

; TELEFAX: (415) 343-4342 

; INFORMATION FOR SEQ ID NO: 6: 
; SEQUENCE CHARACTERISTICS: 
; LENGTH: 826 amino acids 

; TYPE: amino acid 

; STRANDEDNESS: single 

; TOPOLOGY: linear 

; MOLECULE type: peptide 
US-08-785-241-6 



Query Match 100.0%; score 4287; DB 1; Length 826; 

Best Local similarity 100.0%; Pred. No. 0; 

Matches 826; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


1 


MEGAGGANDKKKISSERRKEKSRDAARSRRSKESEVFYELAHOLPLPHNVSSHLDKASVM 


60 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 
* 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j 




Db 


1 


MEGAGGANDKKKISSERRKEKSRDAARSRRSKESEVFYELAHQLPLPHNVSSHLDKASVM 


60 


Qy 


61 


RLTISYLRVRKLLDAGDLDIEDDMKAQMNCFYLKALDGFVMVLTDDGDMIYISDNVNKYM 


120 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
i t i i i i i i i i i i i i i i i i i i i i i I i i 1 i i I i i i i i i i i i 1 i i i i i i 1 i i i i i i i i i i i i i 




Pb 


61 


RLTISYLRVRKLLDAGDLDIEDDMKAQMNCFYLKALDGFVMVLTDDGDMIYISDNVNKYM 


120 


Qy 


121 

JL C J- 


GL TOFFLTGHSVFDFTHPCDHEEMREMLTHRNGLVKKGKEONTORSFFLRMKCTLTSRGR 

VJ l_ 1 \Lf 1 l_ l_ I V]| IbJVIL/I III! 1 IUUI ll\UI IL. 1 1 1 IXI t| XJ I— V IX IX VJ IX I— V£ 1 t| 1 V£ l\JI 1 L. IXI'IIX V. 1 l_ 1 *J 1 XVI l\ 


180 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 I 1 1 
1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


121 


GLTQFELTGHSVFDFTHPCDHEEMREMLTHRNGLVKKGKEQNTQRSFFLRMKCTLTSRGR 


180 


Qy 


181 


TMNIKSATWKVLHCTGHIHVYDTNSNOPOCGYKKPPMTCLVLICEPIPHPSNIEIPLDSK 

l I M i J» ix -J f \ i v w ix v li 1 v»_ i vj i i ± i i v i \j i i ^i i i v£V— vj i ix ix r i ri i v. i_ v w j- v- L— i jl. t iif i ^ j- i i_ l*/ i x 


240 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


181 


TMNIKSATWKVLHCTGHIHVYDTNSNQPQCGYKKPPMTCLVLICEPIPHPSNIEIPLDSK 


240 


Qy 


741 


TPI QPM<;i nMKF^YmFRTTFI Mf^YFPFFI 1 (^R^TYFYYHAI n^nHI TKTHHHMFTIv'CirA/ 


J \J\J 




1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 1 1 II 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 II 1 1 1 1 




Db 


241 


TFLSRHSLDMKFSYCDERITELMGYEPEELLGRSIYEYYHALDSDHLTKTHHDMFTKGQV 


300 


Qy 


301 

J \J JL 


TTGOYRMLAKRGGYVWVFTOATVIYNTKNSOPOCIVCVNYWSGIIOHDLIFSLOOTECV 

1 1 VI V( 1 IXl 1 L-/»IX 1 x VJ VJ | V V* V L 1 x^/l 1 V 1 1 11 1 IX 1 1 1 \^\«X V V. VIII V V -mj VI J. J— V£ 1 IL/L> J> 1 *J 1— V£V£ ■ L_ V. V 


360 




1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 
ii ii i ii i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 1 i i t i i i i i 




Db 


301 


TTGQYRMLAKRGGYVWVETQATVIYNTKNSQPQCIVCVNYWSGIIQHDLIFSLQQTECV 


360 


Qy 


361 


LKPVESSDMKMTQLFTKVESEDTSSLFDKLKKEPDALTLLAPAAGDTIISLDFGSNDTET 


420 




1 1 1 1 1 M I II 1 1 1 1 1 1 1 1 1 1 Ml 1 1 1 II 1 1 1 II II 1 1 1 II 1! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


361 


LKPVESSDMKMTQLFTKVESEDTSSLFDKLKKEPDALTLLAPAAGDTIISLDFGSNDTET 


420 


Qy 


421 


DDQQLEEVPLYNDVMLPSPNEKLQNINLAMSPLPTAETPKPLRSSADPALNQEVALKLEP 


480 




1 1 1 II 1 1 1 1 1 1 1 II II 1 1 1 ! 1 1 1 1 II 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 Ml 1 II 1 M 1 1 1 1 1 1 1 




Db 


421 


DDQQLEEVPLYNDVMLPSPNEKLQNINLAMSPLPTAETPKPLRSSADPALNQEVALKLEP 


480 


Qy 


481 


NPESLELSFTMPQIQDQTPSPSDGSTRQSSPEPNSPSEYCFYVDSDMVNEFKLELVEKLF 


540 




Ml 1 II 1 M MM II 1 1 1 II 1 1 M II 1 M 1 1 1 1 II 1 1 Mill 1 1 1 1 1 II III 1 II II II 1 




Db 


481 


NPESLELSFTMPQIQDQTPSPSDGSTRQSSPEPNSPSEYCFYVDSDMVNEFKLELVEKLF 


540 


Qy 


541 


AEDTEAKNPFSTQDTDLDLEMLAPYIPMDDDFQLRSFDQLSPLESSSASPESASPQSTVT 


600 




1 1 Ml 1 1 II 1 1 1 M II 1 1 II 1 1 III 1 1 1 MM 1 1 II 1 M 1 1 1 1 1 1 II 1 M 1 1 II M M 1 1 


600 


Db 


541 


AEDTEAKNPFSTQDTDLDLEMLAPYIPMDDDFQLRSFDQLSPLESSSASPESASPQSTVT 


Qy 


601 


VFQQTQIQE PTANATTTTATTDE LKTVTKDRMEDIKI LI AS PS PTHI HKETTSATSS PYR 


660 




Ml 1 1 1 1 1 1 III 1 1 1 1 1 1 M 1 1 1 1 II 1 M 1 1 1 1 II 1 1 1 II II 1 1 1 1 1 1 II 1 1 II II II 1 




Db 


601 


VFQQTQIQE PTANATTTTATTDE LKTVTKDRMEDIKI LIAS PS PTHIHKETTSATSS PYR 


660 


Qy 


661 


DTQSRTASPNRAGKGVIEQTEKSHPRSPNVLSVALSQRTTVPEEELNPKILALQNAQRKR 


720 




M 1 1 Ml MM 1 II 1 II 1 1 M 1 MM 1 1 Mill 1 II 1 1 1 1 1 1 1 1 1 II 1 III 1 1 1 M 1 1 1 1 




Db 


661 


DTQSRTASPNRAGKGVIEQTEKSHPRSPNVLSVALSQRTTVPEEELNPKILALQNAQRKR 


720 
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Qy 721 KMEHDGSLFQAVGIGTLLQQPDDHAATTSLSWKRVKGCKSSEQNGMEQKTIILIPSDLAC 780 

1 1 1 1 1 1 1 1 II M II 1 1 1 1 1 III 1 1 III 1 1 Ml 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II II I 1 1 1 1 1 

Db 721 KMEHDGSLFQAVGIGTLLQQPDDHAATTSLSWKRVKGCKSSEQNGMEQKTIILIPSDLAC 780 

Qy 781 RLLGQSMDESGLPQLTSYDCEVNAPIQGSRNLLQGEELLRALDQVN 826 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 II 1 1 II I !l I II M 1 1 1 1 1 1 1 

Db 781 RLLGQSMDESGLPQLTSYDCEVNAPIQGSRNLLQGEELLRALDQVN 826 

RESULT 2 

US-08-480-473B-2 

Sequence 2, Application US/08480473B 
Patent No. 5882914 
GENERAL INFORMATION: 

APPLICANT: Semenza, Gregg L. 

TITLE OF INVENTION: HYPOXIA INDUCIBLE FACTOR-1 AND METHOD OF USE 
NUMBER OF SEQUENCES: 64 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Fish & Richardson P.c. 
STREET: 4225 Executive Square, Suite 1400 
CITY: La Jolla 
STATE : " CA 
COUNTRY: USA 
ZIP: 92037 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/480 , 473B 
FILING DATE: 06-JUN-1995 
CLASSIFICATION: 514 
ATTORNEY/AGENT INFORMATION: 

NAME: Haile, Lisa A. 
REGISTRATION NUMBER: 38,347 
REFERENCE/DOCKET NUMBER: 07265/053001 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 619/678-5070 
TELEFAX: 619/678-5099 
INFORMATION FOR SEQ ID NO: 2: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 826 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
molecule TYPE: protein 
US-08-480-473B-2 

Query Match 100.0%; Score 4287; DB 2; Length 826; 

Best Local similarity 100.0%; Pred. No. 0; 

Matches 826; conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MEGAGGANDKKKISSERRKEKSRDAARSRRSKESEVFYELAHQLPLPHNVSSHLDKASVM 60 

1 1 1 1 1 1 II 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 !! I II II I 1 1 1 1 1 1 1 1 1 1 1 1 1 !l 1 1 1 1 1 II 1 1! 1 1 1 

Db 1 MEGAGGANDKKKISSERRKEKSRDAARSRRSKESEVFYELAHQLPLPHNVSSHLDKASVM 60 

Qy 61 RLTISYLRVRKLLDAGDLDIEDDMKAQMNCFYLKALDGFVMVLTDDGDMIYISDNVNKYM 120 

I M I II II I 1 1 1 1 1 1 1 1 1 II I 1 1 I I 1 1 II 111 1 1 I 1 1 1 1 I I I 1 1 1 1 I 1 1 1 1 I Ml I I II I 

Db 61 RLTISYLRVRKLLDAGDLDIEDDMKAQMNCFYLKALDGFVMVLTDDGDMIYISDNVNKYM 120 

Qy 121 GLTQFELTGHSVFDFTHPCDHEEMREMLTHRNGLVKKGKEQNTQRSFFLRMKCTLTSRGR 180 

I Ml MM I II II ill II Ml II II II Mill III IN! Ml INI I II II I MM I III 
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Db 


121 


GLTQFELTGHSVFDFTHPCDHEEMREMLTHRNGLVKKGKEQNTQRSFFLRMKCTLTSRGR 


180 


Qy 


181 


TMNIKSATWKVLHCTGHIHVYDTNSNQPQCGYKKPPMTCLVLICEPIPHPSNIEIPLDSK 


240 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

llllllllllllllllllllllllllllllllllllllllllilllllllrillllllll 




Db 


181 


TMNIKSATWKVLHCTGHIHVYDTNSNQPQCGYKKPPMTCLVLICEPIPHPSNIEIPLDSK 


240 


Qy 


241 


TFLSRHSLDMKFSYCDERITELMGYEPEELLGRSIYEYYHALDSDHLTKTHHDMFTKGQV 


300 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 




Db 


241 


TFLSRHSLDMKFSYCDERITELMGYEPEELLGRSIYEYYHALDSDHLTKTHHDMFTKGQV 


300 


Qy 


301 


TTGQYRMLAKRGGYVWVETQATVIYNTKNSQPQCIVCVNYWSGIIQHDLIFSLQQTECV 


360 




II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 




Db 


301 


TTGQYRMLAKRGGYVWVETQATVIYNTKNSQPQCIVCVNYWSGIIQHDLIFSLQQTECV 


360 


Qy 


361 


LKPVESSDMKMTQLFTKVESEDTSSLFDKLKKEPDALTLLAPAAGDTIISLDFGSNDTET 


420 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 




Db 


361 


LKPVESSDMKMTQLFTKVESEDTSSLFDKLKKEPDALTLLAPAAGDTIISLDFGSNDTET 


420 


Qy 


421 


DDQQLEEVPLYNDVMLPSPNEKLQNINLAMSPLPTAETPKPLRSSADPALNQEVALKLEP 


480 
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DDQQLEEVPLYNDVMLPSPNEKLQNINLAMSPLPTAETPKPLRSSADPALNQEVALKLEP 


480 


Qy 


481 


NPESLELSFTM POIODOTPSPSDGSTROSSPEPNSPSEYCFYVDSDMVNEFKLELVEKLF 


540 
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Db 


481 


NPESLELSFTMPQIQDQTPSPSDGSTRQSSPEPNSPSEYCFYVDSDMVNEFKLELVEKLF 


540 


Qy 


541 


AEDTEAKNPFSTQDTDLDLEMLAPYIPMDDDFQLR5FDQLSPLESSSASPESASPQSTVT 


600 
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AEDTEAKNPFSTQDTDLDLEMLAPYIPMDDDFQLRSFDQLSPLESSSASPESASPQSTVT 


600 


Qy 


601 


VFOOTOIOEPTANATTTTATTDELKTVTKDRMEDIKILIASPSPTHIHKETTSATSSPYR 


660 
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Db 
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VFQQTQIQEPTANATTTTATTDELKTVTKDRMEDIKILIASPSPTHIHKETTSATSSPYR 


660 


Qy 


661 


DTOSRTASPNRAGKGVIEOTEKSHPRSPNVLSVALSORTTVPEEELNPKILALONAORKR 


720 
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Db 
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DTQSRTASPNRAGKGVIEQTEKSHPRSPNVLSVALSQRTTVPEEELNPKILALQNAQRKR 


720 


Qy 


721 


KMEHDGSLFQAVGIGTLLQQPDDHAATTSLSWKRVKGCKSSEQNGMEQKTIILIPSDLAC 


780 
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KMEHDGSLFQAVGIGTLLQQPDDHAATTSLSWKRVKGCKSSEQNGMEQKTIILIPSDLAC 


780 


Qy 


781 


RLLGQSMDESGLPQLTSYDCEVNAPIQGSRNLLQGEELLRALDQVN 826 
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Db 
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RLLGQSMDESGLPQLTSYDCEVNAPIQGSRNLLQGEELLRALDQVN 826 
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us-09-922-958-5.rai 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 

Run on: July 27, 2005, 07:43:11 ; Search time 3.68182 Seconds 

(without alignments) 
1094.853 Million cell updates/sec 

Title: US-09-922-958-5 
Perfect score: 275 

Sequence: 1 KLELVEKLFAEDTEAKNPFS IPMDDDFQLRSFDQLSPLES 54 

scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 513545 seqs, 74649064 residues 

Total number of hits satisfying chosen parameters: 513545 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : lssued_Patents^A: * 



/cgn2_6/ptodata/l/i aa/5A_COMB . pep : * 
/cgn2_6/ptodata/l/i aa/5 B_COMB . pep : * 
/cgn2_6/ptodata/l/i aa/6A^COMB . pep : * 
/cgn2_6/ptodata/l/i aa/6B_C0MB . pep : * 
/cgn2_6/ptodata/l/i aa/PCTUS_C0MB . pep : * 
/cgn2_6/ptodata/l/i aa/backf i 1 esl. pep : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 



Result 
No. 


Score 


Query 
Match 


Length DB 


ID 


Descri ption 


1 


275 


100.0 


116 


4 


US-09-438-833-8 


Sequence 


8, 


2 


275 


100. 


0 


288 


4 


US-09-438-833-9 


Sequence 


9, 


3 


275 


100. 


0 


301 


4 


US-09-438-833-10 


Sequence 


10 


4 


275 


100. 


0 


311 


4 


US-09-438-833-7 


Sequence 


7, 


5 


275 


100 


0 


532 


4 


US-09-949-016-7389 


Sequence 


73 


6 


275 


100 


0 


613 


4 


US-09-438-833-6 


Sequence 


6, 


7 


275 


100 


0 


652 


4 


US-09-438-833-5 


Sequence 


5, 


8 


275 


100 


0 


756 


4 


US-09-438-833-11 


Sequence 


11 


9 


275 


100 


0 


805 


2 


US-08-480-473B-4 


Sequence 


4, 


10 


275 


100 


0 


805 


3 


US-08-915-213-4 


Sequence 


4, 


11 


275 


100 


0 


805 


3 


US-09-235-217-4 


Sequence 


4, 


12 


275 


100 


0 


805 


5 


PCT-US96-10251-4 


Sequence 


4, 


13 


275 


100 


0 


810 


1 


US-08-785-241-7 


Sequence 


7, 


14 


275 


100 


0 


813 


4 


US-09-438-833-12 


Sequence 


12 


15 


275 


100 


0 


826 


1 


US-08-785-241-6 


Sequence 


6, 


16 


275 


100 


0 


826 


2 


US-08-480-473B-2 


Sequence 


2, 


17 


275 


100 


0 


826 


3 


US-08-915-213-2 


Sequence 


2, 
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275 


100.0 


826 


3 


19 


275 


100.0 


826 


3 


20 


275 


100.0 


826 


3 


21 


275 


100.0 


826 


4 


22 


275 


100.0 


826 


4 


23 


275 


100.0 


826 


4 


24 


275 


100.0 


826 


4 


25 


275 


100.0 


826 


4 


26 


275 


100.0 


826 


4 


27 


275 


100.0 


826 


4 


28 


275 


100.0 


826 


4 


29 


275 


100.0 


826 


4 


30 


275 


100.0 


826 


4 


31 


275 


100.0 


826 


4 


32 


275 


100.0 


826 


5 


33 


275 


100.0 


827 


4 


34 


271 


98. 5 


54 


4 


35 


180 


65. 5 


34 


4 


36 


141. 5 


51. 5 


875 


1 


37 


139. 5 


50.7 


205 


3 


38 


139.5 


50.7 


870 


1 


39 


139.5 


50.7 


870 


3 


40 


101 


36.7 


19 


4 


41 


89 


32.4 


19 


4 


42 


64.5 


23.5 


143 


4 


43 


64.5 


23.5 


1507 


4 


44 


63 


22.9 


440 


4 


45 


62 


22.5 


335 


4 



us-09-922-958-5.rai 

US-09-148-547-2 Sequence 2, Appli 

US-09-235-217-2 Sequence 2, Appli 

US-09-380-662-23 Sequence 23, Appl 

US-09-438-833-1 Sequence 1, Appli 

US-09-702-705-330 Sequence 330, App 

US-09-736-457-330 Sequence 330, App 

US-09-383-581-2 Sequence 2, Appli 

US-09-614-124B-330 Sequence 330, App 

US-09-671-325-330 Sequence 330, App 

US-09-589-184-330 Sequence 330, App 

US-09-658-824-330 Sequence 330, App 

US-09-959-873B-18 Sequence 18, Appl 

US-09-949-016-6089 Sequence 6089, Ap 

US-09-967-388-4 Sequence 4, Appli 

PCT-US96-10251-2 Sequence 2, Appli 



US-09-919-039-149 Sequence 149, App 

>pl 

US-09-959-8738-9 Sequence 9, Appli 



US-09-438-833-13 Sequence 13, Ape 



US-08-785-241-5 Sequence 5, Appli 

US-09-374-454-2 Sequence 2, Appli 

US-08-785-241-4 Sequence 4, Appli 

US-09-374-454-6 Sequence 6, Appli 

US-09-959-873Bt8 Sequence 8, Appli 

us-09-972-784-5 Sequence 5, Appli 

US-09-270-767-42631 Sequence 42631, A 

US-09-914-259-37 Sequence 37, Appl 

US-09-538-092-1264 Sequence 1264, Ap 

US-09-328-352-7832 Sequence 7832, Ap 



ALIGNMENTS 



RESULT 1 
US-09-438-833-8 

; Sequence 8, Application US/09438833 

; Patent No. 6436654 

; GENERAL INFORMATION: 

; APPLICANT: Pharmacia & Upjohn 

; TITLE OF INVENTION: Protein variants 

; FILE REFERENCE: 1848 

; CURRENT APPLICATION NUMBER: US/09/438 , 833 
; CURRENT FILING DATE: 1999-11-12 
; NUMBER OF SEQ ID NOS : 15 

; SOFTWARE: Patentin Ver. 2.1 

; SEQ ID NO 8 
; LENGTH: 116 
; TYPE: PRT 

; ORGANISM: Artificial Sequence 
; FEATURE: 

; OTHER INFORMATION: Description of Artificial Sequence: Subdomain 
; OTHER INFORMATION: 526-641 of human HIF-1 alpha 
US-09-438-833-8 

Query Match 100.0%; Score 275; DB 4; Length 116; 

Best Local similarity 100.0%; Pred. No. 2.5e-30; 

Matches 54 ; Conservati ve 0 ; Mi smatches 0 ; Indel s 0 ; Gaps 0 ; 

Qy 1 KLELVEKLFAEDTEAKNPFSTQDTDLDLEMLAPYIPMDDDFQLRSFDQLSPLES 54 

I I I I 11 I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 7 KLELVEKLFAEDTEAKNPFSTQDTDLDLEMLAPYIPMDDDFQLRSFDQLSPLES 60 
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us-09-922-958-5.rai 

RESULT 2 
US-09-438-833-9 

; Sequence 9, Application US/09438833 

; Patent No. 6436654 

; GENERAL INFORMATION: 

; APPLICANT: Pharmacia & Upjohn 

; TITLE OF INVENTION: Protein variants 

; FILE REFERENCE: 1848 

; CURRENT APPLICATION NUMBER: US/09/438 , 833 
; CURRENT FILING DATE: 1999-11-12 
; NUMBER OF SEQ ID NOS: 15 

; SOFTWARE: Patentln Ver. 2.1 
; SEQ ID NO 9 
; LENGTH: 288 
; TYPE: PRT 

; ORGANISM: Artificial Sequence 
; FEATURE: 

; OTHER INFORMATION: Description of Artificial Sequence: Subdomain 
; OTHER INFORMATION: 526-813 of human HIF-1 alpha 
US-09-438-833-9 

Query Match 100.0%; Score 275; DB 4; Length 288; 

Best Local similarity 100.0%; Pred. No. 8.1e-30; 

Matches 54; Conservative 0; Mismatches 0; Indels 0; Gaps 
Qy 1 KLELVEKLFAEDTEAKNPFSTQDTDLDLEMLAPYIPMDDDFQLRSFDQLSPLES 54 

M I 1 1 1 1 1 1 1 1 I I I I 1 1 I I! I 1 1 I I I I I 1 1 ! I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I M I I 

Db 7 KLELVEKLFAEDTEAKNPFSTQDTDLDLEMLAPYIPMDDDFQLRSFDQLSPLES 60 



RESULT 3 

US-09-438-833-10 

; sequence 10, Application US/09438833 

; Patent No. 6436654 

; GENERAL INFORMATION: 

; APPLICANT: Pharmacia & Upjohn 

; TITLE OF INVENTION: Protein variants 

; FILE REFERENCE: 1848 

; CURRENT APPLICATION NUMBER: US/09/438 , 833 
; CURRENT FILING DATE: 1999-11-12 
; NUMBER OF SEQ ID NOS: 15 

; SOFTWARE: Patentln Ver. 2.1 

; SEQ ID NO 10 
; LENGTH: 301 
; TYPE: PRT 

; ORGANISM: Artificial Sequence 
; FEATURE: 

; OTHER INFORMATION: Description of Artificial Sequence: subdomain 
; OTHER INFORMATION: 526-826 of human HIF-1 alpha 
US-09-438-833-10 



Query Match 100.0%; Score 275; DB 4; Length 301; 

Best Local similarity 100.0%; Pred. No. 8.6e-30; 

Matches 54; Conservative 0; Mismatches 0; Indels 0; Gaps 
Qy 1 KLELVEKLFAEDTEAKNPFSTQDTDLDLEMLAPYIPMDDDFQLRSFDQLSPLES 54 

' I I I I I I I ! I I I I I I I I I I I I I I Mi I I I I I I I I! I I I I I ! I I I I I I Ml I I I I 

Db 7 KLELVEKLFAEDTEAKNPFSTQDTDLDLEMLAPYIPMDDDFQLRSFDQLSPLES 60 
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